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DOMAINS OF EXTRACELLULAR REGION OF HUMAN 
5 PLATELET DERIVED GROWTH FACTOR RECEPTOR POLYPEPTIDES 

FIELD OF THE INVENTION 
The present invention relates to receptors for growth 
factors r particularly to human platelet-derived growth factor 

10 receptors (hPDGF-R) . More particularly , it provides various 
composite constructs of human platelet-derived growth factor 
receptors, these constructs retaining ligand binding regions 
found in the natural extracellular region of the receptors . It 
also provides recombinant nucleic acids encoding these 

15 polypeptides, typically also comprising a promoter for 
expression, and fusion peptides on the amino or carboxy 
terminus of the expressed extracellular composite structure. 
Antibodies are provided which recognize epitopes containing 
amino acids contained in different domains of the extracellular 

20 region. Cells comprising these polypeptides and nucleic acids, 
and diagnostic uses of these reagents are also provided, 

BACKGROUND OF THE INVENTION 
Polypeptide growth factors are mitogens that act on 

25 cells by specifically binding to receptors located on the cell 
plasma membrane. The platelet-derived growth factor (PDGF) 
stimulates a diverse group of biochemical responses, e.g., 
changes in ion fluxes, activation of various kinases, 
alteration of cell shape, transcription of various genes, and 

30 modulation of enzymatic activities associated with phospholipid 
metabolism. See, e.g., Bell et al. (1989) "Effects of Platelet 
Factors on Migration of Cultured Bovine Aortic Endothelial and 
Smooth Muscle Cells," Circulation Research 65:1057-1065. 

Platelet-derived growth factors are found in higher 

35 animals, particularly in warm blooded animals, e.g., mammals. 
In vitro, PDGF is a major polypeptide mitogen in serum for 
cells of mesenchymal origin such as fibroblasts, smooth muscle 
cells,, and glial cells. In vivo, PDGF does not normally 
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Circulate freely in blood, but is stored in the alpha granules 
of circulating blood platelets. During blood clotting and 
platelet adhesion the granules are released, often at sites of 
injured blood vessels, thereby implicating PDGF in the repair 
of blood vessels. PDGF may stimulate migration of arterial 
smooth muscle cells from the medial to the intimal layer of the 
artery where the muscle cells may proliferate. This is liXely 
to be an early response to injury. 

PDGF has also been implicated in wound healing, in 
atherosclerosis, in myeloproliferative disease, and in 
stimulating genes associated with cancerous transformation of 
cells, particularly c-myc and c-fos. 

The platelet-derived growth factor is composed of two 
homologous polypeptide chains; it is a dimer of 16 kilodalton 
proteins which are disulfide connected. These polypeptides are 
of two types, the type B chain and' the type A chain. Three 
forms of the growth factor dimer are found corresponding to a 
homodimer of two type A chains, a homqdimer of two type B 
chains, and a heterodimer of the type A chain with the type B 
chain. Each of these three different combinations is referred 
to as a PDGF isoform. See, for a review on PDGF, Ross et al. 
(1986) "The Biology of Platelet-Derived Growth Factor, " £ell 
46:155-169. The growth factor sequences from mouse and human 
are highly homologous. 

The PDGF acts by binding to the platelet-derived 
growth factor receptor (PDGF-R) . The receptor is typically 
found on cells of mesenchymal origin. The functional receptor 
acts while in a form comprising of two transmembrane 
glycoproteins, each of which is about 180 kilodaltons. Two 
different polypeptides have been isolated, a type B receptor 
polypeptide and a type A receptor polypeptide. 

A sequence of a type B receptor polypeptide of the 
■ mouse platelet-derived growth factor receptor polypeptide is 
published in Yarden et al. (1986) Nature 323:226-232. A 
sequence of an type A human platelet-derived growth factor 
receptor (hPDGF-R) polypeptide is disclosed in Matsui et al. 
(1989) Science 243: 800-803. 
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These PDGF receptors usually have three major 
identifiable regions. The first is a transmembrane region (TM) 
which spans the plasma membrane once, separating the regions of 
the receptor exterior to the cell from the regions interior to 
5 the cell. The second region is an extracellular region (XR) 
which contains the domains that bind the polypeptide growth 
factor (i.e., the ligand binding domains). The third is an 
intracellular region (IR) which possesses a tyrosine kinase 
activity. This tyrosine kinase domain is notable in having an 

10 insert of about 100 amino acids, as compared with most other 
receptor tyrosine kinase domains which are contiguous or have 
shorter insert segments. 

The complete sequences of the human type B and human 
type A receptor polypeptides are reported elsewhere, e.g., 

15 U.S. S.N. 07/309,322, which is hereby incorporated herein by 

reference. However, for many purposes, a smaller or less than 
full length functional protein would be desired. For example, 
smaller molecules may be more easily targeted to areas of 
compromised circulation, or present fewer epitopes or 

20 extraneous domains unrelated to various activities of interest. 
Functional analogues with a slightly modified spectrum of 
activity, or different specificity would be very useful. 

. Thus, the use of new composite constructs exhibiting 
biological activity in common with platelet-derived growth 

25 factor receptor polypeptides will have substantial use as 
research, reagents, diagnostic reagents, and therapeutic 
reagents. In particular, the identification of important 
polypeptide features in the extracellular region of the 
platelet-derived growth factor receptor polypeptides will allow 

30 substitutions and deletions of particular features of the 

domains. Moreover, use of an in vitro assay system provides 
the ability to test cytotoxic or membrane disruptive compounds. 
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SUMMARY OF THE INVENTION 
In accordance with the present invention, defined 
constructs of modified human platelet-derived growth factor 
receptor polypeptides are provided* Extracellular tegion 

5 domain structures are identified and modifications and 
combinatorial rearrangements of the receptor segments are 
furnished. Both cell bound and soluble forms of modified 
segments are made available, as are methods for assays using 
them, thereby allowing for screening of ligand analogues. 

10 The present invention provides a platelet-derived 

growth factor receptor (hPDGF-R) fragment of between about 8 
and 400 amino acids comprising one or more platelet-derived 
growth factor (PDGF) ligand binding regions (LBR* s) from 
extracellular domains Dl, D2, or D3, wherein the fragment binds 

15 a platelet-derived growth factor ligand. Generally, the 
fragment will exhibit a binding affinity of about 5 nM or 
better and will have a sequence of at least about 6 or 8 
contiguous amino acids, preferably at least about 15 or- more 
contiguous amino acids from a domain D3 intra-cysteine region. 

20 The fragment will often lack a transmembrane region. In other 
embodiments, the fragment is soluble, is substantially pure, or 
has at least one ligand binding region derived from a domain 
D3. The fragment may be derived from a type B, or from a type 
A PDGF-R LBR fragment, e.g. , from Table 1 or Table 2. In 

25 particular embodiments, the fragment is selected from the group 
of formulae consisting of: 

a) Xa-Dm-Xc; 

b) Xa-Dm-Xl-Dn-Xc ; 

c) Xa-Dm-Xl-Dn-X2-Dp-Xc; and 
30 d) Xa-Dm-Xl~Dn-X2-Dp-X3-Dq-Xc; 

e) Xa-Dm-Xl-Dn-X2-Dp-X3-Dq-X4-Dr-Xc ? 
where the fragment is not D1-D2-D3-D4-D5; 

each of Xa, XI, X2, X3, and Xc is, if present, a 
polypeptide segment lacking a D domain; and 
35 each of Dm, Dn, Dp, and Dq is, independently of one 

another, selected from the group consisting of Dl, D2, D3, D4, 
and D5. Preferred fragments are selected from the group 
consisting of: 
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a) D1-D2-D3 or D3-D4-D5; and 

b) . D1-D2-D3-D4 or D2-D3-D4-D5. 

The present invention also embraces a soluble human 
platelet-derived growth factor receptor (hPDGF-R) fragment of 
5 between about 10 and 350 amino acids comprising at least one * 
platelet-derived growth factor (PDGF) ligand binding region 
(LBR) from a domain D3, wherein the fragment specifically binds 
to a platelet-derived growth factor ligand. Usually the 
fragment comprises a sequence of at least about 15 contiguous 

10 amino acids from the intra-cysteine portion of domain D3 and 

has a binding affinity of better than about 5 nM. Other useful 
fragment embodiments will be soluble, substantially pure, or a 
type B or type A PDGF-R LBR, e.g., from Table 1 or Table 2. 

The invention also includes nucleic acid sequences, 

15 including those encoding the above described polypeptide 

fragments. Often the nucleic acid sequences incorporate a 
promoter, generally operably linked to the sequence encoding 
the fragments. 

Cells comprising the nucleic acids or peptides of the 
20 invention are also embraced. In particular cell embodiments, 
the cell will be a mammalian cell, and often will contain both 
a nucleic acid and a protein expression product of the nucleic 
acid. 

The compositions described above provide antibodies 
25 which recognize an epitope of a described PDGF-R fragment, but 
not a natural PDGF-R epitope. The antibody will often be a 
monoclonal antibody. 

The present invention also provides a method for 
measuring the PDGF receptor binding activity of a biological 
30 sample comprising the steps of: 

a) contacting an aliquot of a sample to a PDGF ligand in 
the presence of a described PDGF-R fragment in a first 
analysis; 

b) contacting an aliquot of the sample to a PDGF ligand 
35 . in the absence of the PDGF-R fragment in a second analysis? and 

c) comparing the amount of binding in the two analyses. 
In some instances, the PDGF-R fragment is attached to a cell, 
or a solid, substrate, e.g., a microtiter dish. 
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The invention also embraces a method for measuring 
the PDGF ligand content of a biological sample comprising the 
steps of: 

a) contacting an aliquot of the sample to a ligand 
5 binding region (LBR) in the presence of a described PDGF-R 

fragment in a first analysis; 

b) contacting an aliquot of the sample to a LBR in the 
absence of the PDGF-R fragment in a second analysis? and 

c) comparing the amount of binding in the two analyses. 
10 In some embodiments , the contacting steps are performed 

simultaneously. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 illustrates a strategy for oligonucleotide 
15 directed in vitro deletion mutagenesis of soluble hPDGF-R 
extracellular domains. Many of these constructs will be 
soluble peptides, or can be modified to be such. 
The abbreviations used are: 

PR = PDGF-R? intact 
20 P = . PDGF-R ? extracellular region 

TM — transmembrane 
K - kinase 
S ' — signal sequence 
Fig. 2 illustrates the structure of a plasmid derived 
25 form pcDL-Sa2?6 used for expressing various deletion 
polypeptides. 

Fig. 3 illustrates the structure of a plasmid pBJA 
derived from pcDLa296. See Takabe et al. (1988) Mol. Cell. 
Biol. 8:466-472. 
30 1. The pcDL-SRa296 is cut with Xhol. 

2. A polylinker (Xhol-Xbal-Sf il-Notl-EcoRI- 
EcoRV-Hindlll-Clal-Sall) is inserted into 
the Xhol cut vector. 

3. Sail is compatible with the Xhol site; and 
35 generates both a Sail and an Xhol site. 

4. The SV40 16s splice junction is no longer 
present. 



WO 92/13867 PCT/US92/00730 



Fig. 4 illustrates the inhibition of receptor 
phosphorylation by a human type B PDGF receptor polypeptide. 
Labeling with a reagent which binds to phosphorylated tyrosine 
shows that phosphorylation activity is decreased in the 
5 presence of the receptor polypeptide fragment. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

I. General Description 

A. PDGF-R 

10 1. structural features 

a. extracellular domain (XR) 

i. signal sequence 

ii. D domains (Ig-like) 

b. transmembrane segment (TM) 
15 c. intracellular domain (IR) 

i. tyrosine kinase 

ii. insert 
2. function 

a. bind ligands (PDGF analogues) 
20 b. tyrosine kinase activity 

c. bind to PDGF-R peptide (dimer 
formation) 

d. phosphorylated segments 

B. Physiological Functions 
25 1. cellular 

2. tissue differentiation 

3. organismal 

II. Polypeptides 

A. p domains 

30 1. 0-sheet strands 

2. cysteine residues 

B. Soluble Forms, extracellular region 

C. Truncated/Deletion Forms 

D. Fusion Proteins 

35 E * Genetic Variants (site-directed rautagenized) 

F. Compositions Comprising Proteins 

III. Nucleic Acids 

A. Isolated Nucleic Acids 

B. Recombinant Nucleic Acids 

40 C. Compositions Comprising Nucleic A^ids 

IV. Methods for Making PDGF-R Constructs 
A. Protein Purification 

1. affinity with derivatized PDGF 

2. various ligands, same receptor 
45 B. Expression of Nucleic Acids 

C. Synthetic methods 

V. Antibodies 

VI. Methods for Use 
A, Diagnostic 

50 B. Therapeutic 
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I. General Description 

A. Platelet-derived growth factor receptor (PDGF-R) 
The human platelet-derived growth factor receptor 
(hPDGF-R) typically comprises two polypeptides. These 
polypeptides, which may be identical or only slightly 
different, associate during the functional activities of ligand 
binding and transducing of the ligand binding signal into the 
cell. 

The platelet-derived growth factor receptor was 
identified as having a major component of an approximately 180 
kilodalton protein which, is glycosylated* This glycoprotein 
was identified as a platelet-derived growth factor receptor 
polypeptide. Primary structures of two homologous forms of 
polypeptides have been reported. A type B receptor nucleic 
acid and its corresponding polypeptide sequence from mouse are 
reported in Yarden et al. (1986) Nature 323: 226-232? and a 
homologous genetic sequence has been isolated from humans. See 
U.S. S.N. 07/309,322. A human type A receptor sequence is 
reported in Matsui et al. (1989) Science 243: 800-803. 
Although the two different forms of the receptor polypeptides 
are homologous, they are encoded by two separate genes. 

The functional receptor apparently involves a dimer 
of these polypeptides, either homodimers of the type B receptor 
polypeptide or of the type A receptor polypeptide, or a 
heterodimer of the type B receptor polypeptide with an type A 
receptor polypeptide. The specificity of binding of each of 
these forms of the receptor is different for each of the 
different forms of platelet-derived growth factor (PDGF) , the 
AA, BB, or AB forms (from either mouse or human, or presumably 
other mammals) . 

The PDGF-R is a member of a family of related 
receptors. See, e.g., Yarden et al. supra . Each of these 
receptor polypeptides has a hydrophobic membrane spanning 
region (TM for transmembrane) , a large extracellular region 
(XR) with regularly spaced cystine residues, and a cytoplasmic 
intracellular region (IR) having intracellular tyrosine kinase 
activity. The XR of the PDGF-R has a predicted structure 
containing 5 0-strand-rich immunoglobulin (Ig)-like domains. 
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Each of these Ig-like domains consists of about 100 amino 
acids, ranging more specifically from about 88 to about 114 
amino acids, and,, except for the fourth domain, contains 
regularly spaced cysteine residues. Many of the structural 
5 features of the various growth factor receptors are homologous, 
including the mouse and human versions of the PDGF-R. Thus, 
many of the structural features defined herein are shared with 
other related proteins. However, in most cases, the functional 
relationship to particular structural features is unknown. 
10 The intracellular region (IR) is that segment of the 

PDGF-R which is carboxy proximal of the transmembrane (TM) 
segment. The intracellular region is characterized, in part, 
by the presence of a split tyrosine kinase structural domain. 
In the human type B receptor polypeptide, the tyrosine kinase 
15 . domain is about 244 amino acids with an insert of about 104 
amino acids. See Table 1. In the human type A receptor 
polypeptide, the domain is about 244 amino acids long with a 
kinase insert of about 103 amino acids. See Table 2, 
Functionally, this domain is defined, in part, by its tyrosine 
20 kinase activity, typically modulated by ligand binding to 

binding sites found in the extracellular region, and appears to 
function in a dimer state. The substrate for phosphorylation 
includes various tyrosine residues on the accompanying receptor 
polypeptide chain, and other proteins which associate with the 
25 receptor. The tyrosine kinase domain is also defined, in part, 
by its homology to similar domains in other tyrosine kinase 
activity containing proteins. See, e.g.,. Yarden et al. (1986) 
Nature 323:226-232. Each IR segment of the dimerized receptor 
complex appears to phosphorylate specific tyrosine residues on 
30 the other polypeptide chain. 



polypeptides is about 24 or 25 amino acids long and is 
characterized by hydrophobic amino acid residues. These 
segments have sequences characteristic of membrane spanning 
35 segments. In the human type B receptor polypeptide the 
transmembrane region appears about 25 amino acids long 
extending from about val(500) to trp(524), while in the human 
type A receptor polypeptide, the transmembrane segment appears 



Each transmembrane segment of the human receptor 



WO 92/13867 PCT/US92/0O730 

10 

to be about 24 amino acids extending from about leu (502) to 
trp(526). See, e.g., Claesson-Welsh et al. (1989) proc. N^t'j 
Read. Sci. USA . 86:4917-4921, 

A polypeptide or nucleic acid is a "human" sequence 

5 if it is derived from, or originated in part from, a natural 
human source. For example, proteins derived from human cells, 
or originally encoded by a human genetic sequence, will be 
human proteins. A sequence is also human if it is selected on 
the basis of its high similarity to a sequence found in a 

10 natural human sample, or is derived therefrom. 

A fusion polypeptide or nucleic acid is a molecule 
which results from the fusion of segments from sequences which 
are not naturally in continuity with one another. Thus, a 
chimeric protein or nucleic acid is a fusion molecule. A 

15 heterologous protein is a protein originating from a different 
source. - - 

B. Physiological Functions 

The PDGF-R appears to have at least four major 
20 different biological functions. The first is the binding of 
ligands, usually the FDGF mitogenic proteins , or their 
analogues. These ligands and analogues may also serve as 
either agonists or antagonists . The ligand binding sites, made 
up of ligand binding regions (LBR's), are localized in the 
25 extracellular region (XR) . The functional receptor transduces 
a. signal in response to ligand binding, and the resulting 
response is a ligand modulated activity. As the likely ligand 
is a PDGF, or an analogue, the signal will ordinarily be PDGF 
modulated, 

30 A second biological activity relates to the tyrosine 

kinase enzymatic activity. This activity is typically 
activated intracellularly in response to ligand binding. 
However, since these receptors apparently function in a dimeric 
state, the interchain binding interactions may be considered a 

35 third biological activity which* may be mediated by blocking 
agents. Blocking o;r interference with the dimerization 
interactions may be mediated by receptor protein fragments, 
particularly, in the functional ligand binding or tyrosine 
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kinase activities . Thus, the introduction of analogues of the 
receptor domains to natural or other receptor polypeptides may 
serve as an additional means to affect PDGF mediation of ligand 
mediated activities. 
5 The fourth function of the PDGF receptor is as a 

binding substrate for other proteins, e.g., the PI3 kinase. In 
particular, the PDGF receptor is phosphorylated at various 
positions in response to ligand binding or other events. This 
binding interaction activates an enzymatic activity on the part 

10 of the binding protein which activates further cellular or 
metabolic responses. 

The term "ligand" refers to the molecules, usually 
members of the platelet-derived growth factor family, that are 
bound by the ligand binding regions (LBR's). The binding 

15 regions are typically found in the XR. Also, a ligand is a 

molecule that serves either as the natural ligand to which the 
receptor binds, or a functional analogue of a ligand. The 
analogue may serve as an agonist or antagonist. Typically 
ligands will be molecules which share structural features of 

20 natural PDGF, e.g., polypeptides having similar amino acid 

sequences or other molecules sharing molecular features, with a 
ligand. The determination of whether a molecule serves as a 
ligand depends upon the measurement of a parameter or response 
which changes upon binding of that ligand, such as dimerization 

25 or tyrosine kinase activity. See, e.g., Gilman et al. (eds) 
(1990) Goodman and Gilman 1 s: The . Pharmacological Bases of 
Therapeutics . 8th Ed., Pergamon Press, which is incorporated 
herein by reference. 

■ The receptor has ligand binding regions (LBR) , or 

30 regions which are important in determining both affinity and 

specificity of binding of ligand, e.g., PDGF and its analogues. 
The ligand binding regions determine the binding interactions 
between the receptors and ligand. Typically, these regions are 
those contact points between the ligand molecule and the 

35 receptor. These molecular interactions can be determined by 

crystallographic techniques, or by testing which regions of the 
receptor are important in ligand interaction. Various segments 
of the extracellular, region of the PDGF receptor make up the 
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ligand binding regions, while other segments form structural 
segments which spatially orient the LBR's in proper arrangement 
to properly bind the ligands. 

Generally, the fragment will have a sequence of at 
5 least about 6 contiguous amino acids, usually at least about 8 
contiguous amino acids, more usually at least about 10 
contiguous amino acids, preferably at least "about 13 contiguous 
amino acids, and more preferably at least about 15 or more 
contiguous amino acids. Usually, the LBRVs will be located 

10 within the intra-cysteine (or equivalent) residues of each Ig- 
like domain, e.g., domains Dl, D2, D3, D4, and D5. They will 
- be preferably derived from D3 sequences, but Dl and D2 derived 
sequences will also be common. Occasionally, sequences from 
D4, D5, or other proteins will provide LBR function. 

15 The extra-cysteine (or equivalent) regions provide 

structural functions, as will inter-domain spacer segments. 
The intra-cysteine portions, or segments, are indicated in 
Tables 4 and 5, and comprise the segments designated C, C«, C", 
D, and E, along with portions of the B and F segments, as 

20 indicated. The extra-cysteine residues comprise the segments 
designated A and G, and portions of B and F. 

The ligand binding regions as defined, in part, by 
the importance of their presence, or their effect on the 
affinity of PDGF ligand binding. The natural, native full 

25 length PDGF-R binds with a K d of about 0.2 mM. See, e.g., Duan 
et al. (1991) J. Biol. Chem. 266:413-418, which is hereby 
incorporated herein by reference. An LBR is a segment of 
polypeptide whose presence significantly affects ligand 
binding, generally by at least about a factor of two, usually 

30 by at least about a factor of four, more usually by at least a 
factor of about eight, and preferably by at least about a 
factor of twelve or more. A fragment of this invention which 
binds to'the PDGF ligand will generally bind with a K fl of less 
than about 10 jiM, more generally less than about 1 usually 

35 less than about 0.1 /iM, more usually less than about 10 nM, 

preferably less than about 1 nM, and more preferably less than 
about 0.5 nM. 
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An epitope . is an antigenic determinant which 
potentially or actually has elicited an antibody response, it 
may also refer to a structural feature which is defined by an 
antibody binding region, or its equivalent. An epitope need 
5 not necessarily be immunogenic, but will serve as a binding 
site for an antibody molecule or its equivalent. 

II. Polypeptides 

Table 1 discloses the sequence of one allele of a 

10 type B human platelet-derived growth factor receptor 
polypeptide. Both a nucleic acid sequence and its 
corresponding protein sequence are provided. The nucleic acid 
sequence corresponds to Seq. ID No. 1. The amino acid sequence 
corresponds to Seq. ID No. 2. A homologous mouse sequence was 

15 reported in Yarden et al. (1988) Nature 323:226-232. The 
sequence of a mouse PDGF receptor polypeptide also exhibits 
structural features in common with the regions, the domains, 
and the 0-strand segments of the human receptor polypeptides. 
The mouse polypeptides , and those from other related receptors , 

20 will serve as a source of similar domains, homologous ^-strand 
segments, and inter-segment sequences, and sequences of 
homology for general replacement or substitutions. 
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TABLE 1 

Sequence of one type B huran PDGF 
receptor polypeptide allele and protein 



TGTTCTCCTGAGCCTTCAGGAGCCTGC^CCAGTCCTG 



52 



S-S2SSSSSSJSSSSJSSS2 

TTC CTG TCP CTC CTG TTA CTT CTG GAA CCA CAG ATC TCT CAG GGC CTG GTC 288 

£eu Su Su Leu Leu Leu Leu Gl« Pro Gin He Ser Gin Gly Leu Val 2 

GTC ACA CCC CCG GGG CCA GAG CTT GTC CTC AAT GTC TCC AGC ACC TTC GTT 339 

Val Thr Pro Pro Gly Pro Glu Leu Val Leu Asn Val Ser Ser Thr Phe Val 19 

CTG ACC TGC TCG GOT TCA GCT CCG GTG GTG TGG GAA CGG ATG TCC CAG GAG 390 

Leu Thr Cys Ser Gly Ser Ala Pro Val Val Trp Glu Arg Met Ser Gin Glu 36 

CCC CCA CAG GAA ATG GCC AAG GCC CAG GAT GGC ACC TTC TCC AGC GTG CTC 441 

Pro Pro Gin Glu Met Ala Lys Ala Gin Asp Gly Thr Phe Ser Ser Val Leu S3 

ACA CTG ACC AAC CTC ACT GGG CTA GAC. ACG GGA GAA TAC TTT TGC ACC CAC 492 

Thr Leu Thr Asn Leu Thr Gly Leu Asp Thr Gly Glu Tyr Phe Cys Thr His 70 

AAT GAC TCC CGT GGA CTG GAG ACC GAT GAG CGG AAA CGG CTC TAC ATC TTT 543 

Asn Asp Ser Arg Gly Leu Glu Thr Asp Glu- Arg Lys Arg Lau Tyr He Phe 87 

GTG CCA GAT CCC ACC GTG GGC TTC CTC CCT AAT GAT GCC GAG GAA CTA TTC 594 

Val Pro Asp Pro Thr Val Gly Phe Leu Pro Asn Asp Ala Glu Glu Leu Phe 104 

ATC TTT CTC ACG GAA ATA ACT GAG ATC ACC ATT CCA TGC CGA GTA ACA GAC 645 

lie Phe Leu Thr Glu lie Thr Glu He Thr He Pro Cys Arg Val Thr Asp 121 

CCA CAG CTG GTG GTG ACA CTG CAC GAG AAG AAA GGG GAC GTT GCA CTG CCT 696 

Pro Gin Lau Val Val Thr Leu His Glu Lys Lys Gly Asp Val Ala Leu Pro 138 

GTC CCC TAT GAT CAC CAA CGT GGC TTT TCT GOT ATC TTT GAG GAC AGA AGC 747 

val Pro Tyr Asp His Gin Arg Gly Phe Ser Gly He Phe Glu Asp Arg Ser 155 



TAC ATC TGC AAA ACC ACC ATT GGG GAC AGG GAG GTG GAT TCT GAT GCC TAC 798 

Tyr He Cys Lys Thr Thr lie Gly Asp Arg Glu Val Asp Ser Asp Ala Tyr 17Z 

TAT GTC TAC AGA CTC CAG GTG TCA TCC ATC AAC GTC TCT GTG AAC GCA GTG 849 

Tyr Val Tyr Arg Leu Gin Val Ser Ser He Asn Val Ser Val Asn Ala Val 189 

CAG ACT GTG GTC CGC CAG GGT GAG AAC ATC ACC CTC ATG TGC ATT GTG ATC 900 

Gin Thr Val Val Arg Gin Gly Glu Asn He Thr Leu Met Cys He Val He 206 

GGG AAT GAT GTG GTC AAC TTC GAG TGG ACA TAC CCC CGC AAA GAA AGT GGG 951 

Gly Asn Asp Val Val Asn Phe Glu Trp Thr Tyr Pro Arg Lys Glu Ser Gly 223 
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CGG CTG GTG GAG CCG GTG ACT GAC TTC CTC TTG GAT ATG CCT TAC CAC ATC 1002 
Arg Leu Val Glu Pro Val Thr Asp Phe Leu Leu Asp Met Pro Tyr His He 240 

CGC TCC ATC CTG CAC ATC CCC AGT GCC GAG TTA GAA GAC TCG GGG ACC TAC 1053 
Arg Ser He Leu His He Pro Ser Ala Glu Leu Glu Asp Ser Gly Thr Tyr 257 

ACC TGC AAT GTG ACG GAG AGT GTG AAT GAC CAT CAG GAT GAA AAG GCC ATC 1104 
Thr Cys Asn Val Thr Glu Ser Val Asn Asp His Gin Asp Glu Lys Ala He 274 

AAC ATC ACC GTG GTT GAG AGC GGC TAC GTG CGG CTC CTG GGA GAG GTG GGC 1155 
Asn He Thr Val Val Glu Ser Gly Tyr Val Arg Leu Leu Gly Glu val Gly 291 

ACA CTA CAA TTT GCT GAG CTG CAT CGG AGC CGG ACA CTG CAG GTA GTG TTC 1206 
Thr Leu Gin Phe Ala Glu Leu His Arg Ser Arg Thr Leu Gin Val Val Phe 30B 

GAG GCC TAC CCA CCG CCC ACT GTC CTG TGG TTC AAA GAC AAC CGC ACC CTG 1257 

Glu Ala Tyr Pro Pro Pro Thr Val Leu Trp Phe Lys Asp Asn Arg Thr Leu .325 

GGC GAC TCC AGC GCT GGC GAA ATC GCC CTG TCC ACG CGC AAC GTG TCG GAG 13 08 
Gly Asp Ser Ser Ala Gly Glu He Ala Leu Ser Thr Arg Asn Val Ser Glu 342 

ACC CGG TAT GTG TCA GAG CTG ACA CTG GTT CGC GTG AAG GTG GCA GAG GCT 1359 
Thr Arg Tyr Val Ser Glu Leu Thr Leu Val Arg Val Lys Val Ala Glu Ala 359 

GGC CAC TAC ACC ATG CGG GCC TTC CAT GAG GAT GCT GAG GTC CAG CTC TCC 1410 
Gly His Tyr Thr Met Arg Ala Phe His Glu Asp Ala Glu Val Gin Leu Ser 376 

TTC CAG CTA CAG ATC AAT GTC CCT GTC CGA GTG CTG GAG CTA AGT GAG AGC 1461 
Phe Gin Leu Gin He Asn Val Pro Val Arg Val Leu Glu Leu Ser Glu Ser 393 

CAC CCT GAC AGT GGG GAA CAG ACA GTC CGC TGT CGT GGC CGG GGC ATG CCG 1512 
His Pro Asp Ser Gly Glu Gin Thr Val Arg Cys Arg Gly Arg Gly Met Pro 410 

CAG CCG AAC ATC ATC TGG TCT GCC TGC AGA GAC CTC. AAA AGG TGT CCA CGT 1563 
Gin Pro Asn He He Trp Ser Ala Cys Arg Asp Leu Lys Arg Cys Pro Arg 427 

GAG CTG CCG CCC ACG CTG CTG GGG AAC AGT TCC GAA GAG GAG AGC CAG CTG 1614 
Glu Leu Pro Pro Thr Leu Leu Gly Asn Ser Ser Glu Glu Glu Ser Gin Leu 444 

GAG ACT AAC GTG ACG TAC TGG GAG GAG GAG CAG GAG TTT GAG GTG GTG AGC 1665 
Glu Thr Asn Val Thr Tyr Trp Glu Glu Glu Gin Glu Phe Glu Val Val ser 461 

ACA CTG CGT CTG CAG CAC GTG GAT CGG CCA CTG TCG GTG CGC TGC . ACG CTG 1716 

Thr Leu Arg Leu Gin His Val Asp Arg Pro lieu Ser Val Arg Cys Thr Leu 478 

CGC AAC GCT GTG GGC CAG GAC ACG CAG GAG GTC ATC GTG GTG CCA CAC TCC 1767 

Arg Asn Ala Val Gly Gin Asp Thr Gin Glu Val He Val Val Pro His Ser 495 

TTG CCC TTT AAG GTG GTG GTG ATC TCA GCC ATC CTG GCC CTG GTG GTG CTC 1818 
Leii Pro Phe Lys Val Val Val He Ser Ala He Leu Ala Leu Val Val Leu 512 
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ACC ATC ATC TCC CTT ATC ATC CTC ATC ATG CTT TGG CAG AAG AAG CCA CGT 1869 
Thr He lie Ser Leu lie lie Leu lie Met Leu Trp Gin Lys Lys Pro Arg 529 

TAC GAG ATC CGA TGG AAG GTG ATT GAG TCT GTG AGC TCT GAC GGC CAT GAG 1920 
Tyr Glu He Arg Trp Lys Val He Glu Ser Val Ser Ser Asp Gly His Glu 546 

TAC ATC TAC GTG GAC CCC ATG CAG CTG CCC TAT GAC TCC ACG TGG GAG CTG 1971 
Tyr lie Tyr Val Asp Pro Met Gin Leu Pro Tyr Asp Ser Thr Trp Glu Leu 563 

CCG CGG GAC CAG CTT GTG CTG GGA CGC ACC CTC GGC TCT GGG GCC TTT GGG 2022 
Pro Arg Asp Gin Leu Val Leu Gly Arg Thr Leu Gly Ser Gly Ala Phe Gly 580 

CAG GTG GTG GAG GCC ACA GCT CAT GGT CTG AGC CAT TCT CAG GCC ACG ATG 2073 
Gin Val Val Glu Ala Thr Ala His Gly Leu Ser His Ser Gin Ala Thr Met 597 

AAA GTG GCC GTC AAG ATG CTT AAA TCC ACA GCC CGC. AGC ACT GAG AAG CAA 2124 
Lys Val Ala Val Lyc Met Leu Lys Ser Thr Ala Arg Ser Ser Glu Lys Gin 614 

GCC CTT ATG TCG GAG CTG AAG ATC ATG AGT CAC CTT GGG CCC CAC CTG AAC 2175 
Ala Leu Met Ser Glu Leu Lys He Met Ser His Leu Gly Pro His Leu Asn 631 

GTG GTC AAC CTG TTG GGG GCC TGC ACC AAA GGA GGA CCC ATC TAT ATC ATC 2226 
Val Val Asn Leu Leu Gly Ala Cys Thr Lys Gly Gly Pro lie Tyr He He 648 

ACT GAG TAC TGC CGC TAC GGA GAC CTG GTG GAC TAC CTG CAC CGC AAC AAA 2277 
Thr Glu Tyr Cys Arg Tyr Gly Asp Leu Val Asp Tyr Leu His Arg Asn Lys 665 

CAC ACC TTC CTG CAG CAC CAC TCC GAC AAG CGC CGC CCG CCC AGC GCG GAG 2328 
His Thr Phe Leu Gin His His Ser Asp Lys Arg Arg Pro Pro Ser Ala Glu 682 

CTC TAC AGC AAT GCT CTG CCC GTT GGG CTC CCC CTG CCC AGC CAT GTG TCC 2379 
Leu Tyr Ser Asn Ala Leu Pro Val Gly Leu Pro Leu Pro Ser His Val Ser 699 

TTG. ACC GGG GAG AGC GAC GGT GGC TAC ATG GAC ATG AGC AAG GAC GAG TCG 2430 
Leu Thr Gly Glu Ser Asp Gly Gly Tyr Met Asp Met Ser Lys Asp Glu Ser 716 

GTG GAC TAT GTG CCC ATG CTG GAC ATG AAA GGA GAC GTC AAA TAT GCA GAC 2481 
Val Asp Tyr Val Pro Met Leu Asp Met Lys Gly Asp Val Lys Tyr Ala Asp 733 

ATC GAG TCC TCC AAC TAC ATG GCC CCT TAC GAT AAC TAC GTT CCC TCT GCC 2532 
He Glu Ser Ser Ash Tyr Met Ala Pro Tyr Asp Asn Tyr Val Pro Ser Ala 750 

CCT GAG AGG ACC TGC CGA GCA ACT TTG ATC AAC GAG TCT CCA GTG CTA AGC 2583 
Pro Glu Arg Thr Cys Arg Ala Thr Leu He Asn Glu Ser Pro Val Leu Ser 767 

TAC ATG GAC CTC GTG GGC TTC AGC TAC CAG GTG GCC AAT GGC ATG GAG TTT 2634 

Tyr Met Asp Leu Val. Gly Phe Ser Tyr Gin Val Ala Asn Gly Met Glu Phe 784 
CTG GCC TCC AAG AAC TGC GTC CAC AGA GAC CTG GCG GCT AGG AAC GTG CTC 2685 

Leu Ala Ser Lys Asn Cys Val His Arg Asp Leu Ala Ala Arg Asn Val Leu 801 
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ATC TGT GAA GGC AAG CTG GTC AAG ATC TGT GAC TTT GGC CTG GCT CGA GAC 2736 
lie cys Glu Gly Lys Leu Val Lys lie Cys Asp Phe Gly Leu Ala Arg Asp 818 

ATC ATG CGG GAC TCG AAT TAC ATC TCC AAA GGC AGC ACC TTT TTG CCT TTA 2787 
lie Met "Arg Asp Ser Asn Tyr lie Ser Lys Gly ser Thr Phe Leu Pro Leu 835 

AAG TGG ATG GCT CCG GAG AGC ATC TTC AAC AGC CTC TAC ACC ACC CTG AGC 2838 
Lys Trp Met Ala Pro Glu Ser lie Phe Asn Ser Leu Tyr Thr Thr Leu Ser 852 

GAC GTG TGG TCC TTC GGG ATC CTG CTC TGG GAG ATC TTC ACC TTG GGT GGC 2889 
Asp Val Trp Ser Phe Gly lie Leu Leu Trp Glu He Phe Thr Leu Gly Gly 869 

ACC CCT TAC CCA GAG CTG CCC ATG AAC GAG CAG TTC TAC AAT GCC ATC AAA 2940 

Thr Pro Tyr Pro Glu Leu Pro Met Asn Glu Gin Phe Tyr Asn Ala lie Lys 886 

CGG GGT TAC CGC ATG GCC CAG CCT GCC CAT GCC TCC GAC GAG ATC TAT GAG 2991 
Arg Gly Tyr Arg Met Ala Gin Pro Ala His Ala Ser Asp Glu He Tyr Glu 903 

ATC ATG CAG AAG TGC TGG GAA GAG AAG TTT GAG ATT CGG CCC CCC TTC TCC 3042 

He Met Gin Lys Cys Trp Glu Glu Lys Phe Glu He Arg Pro Pro Phe Ser 920 

CAG CTG GTG CTG CTT CTC GAG AGA CTG TTG GGC GAA GGT TAC AAA AAG AAG 3093 
Gin Leu Val Leu Leu Leu Glu Arg Leu Leu Gly Glu Gly Tyr Lys Lys Lys 937 

TAC CAG CAG GTG GAT GAG GAG TTT CTG AGG AGT GAC CAC CCA GCC ATC CTT 3144 
Tyr Gin Gin Val Asp Glu Glu Phe Leu Arg Ser Asp His Pro Ala He Leu 954 

CGG TCC _CAG GCC CGC TTG CCT . GGG TTC CAT GGC CTC CGA TCT CCC CTG GAC 3195 
Arg Ser Gin Ala Arg Leu Pro Gly Phe His Gly Leu Arg Ser Pro Leu Asp 971 

ACC AGC TCC GTC CTC TAT ACT GCC GTG CAG CCC AAT GAG GGT GAC AAC GAC 324 6 
Thr Ser Ser Val Leu Tyr Thr Ala Val Gin Pro Asn Glu Gly Asp Asn Asp 989 

TAT ATC ATC CCC CTG CCT GAC CCC AAA CCT GAG GTT GCT GAC GAG GGC CCA 3297 
Tyr He He Pro Leu Pro Asp Pro Lys Pro Glu Val Ala Asp Glu Gly Pro 1005 

CTG GAG GGT TCC CCC AGC CTA GCC AGC TCC ACC CTG AAT GAA GTC AAC ACC 3348 
Leu Glu Gly Ser Pro Ser Leu Ala Ser Ser Thr Leu Asn Glu Val Asn Thr 1022 

TCC TCA ACC ATC TCC TGT GAC AGC CCC CTG GAG CCC CAG GAC GAA CCA GAG 3399 
Ser Ser Thr He Ser cys Asp Ser Pro Leu Glu Pro Gin Asp Glu Pro Glu 1039 

CCA GAG CCC CAG CTT GAG CTC CAG GTG GAG CCG GAG CCG GAG CTG GAA CAG 3450 
Pro Glu Pro Gin Leu Glu Leu Gin Val Glu Pro Glu Pro Glu Leu Glu Gin 1056 

TTG CCG GAT TCG GGG TGC CCT GCG CCT CGG GCG GAA GCA GAG GAT AGC TTC 3501 
Leu Pro Asp Ser Gly Cys Pro Ala Pro Arg Ala Glu Ala Glu Asp Ser Phe 1073 



CTG TAGGGGGCTGGCCCCTACCCTGCCCTGCCTGAAGCTCCCCCGCTGCCAGCACCCAGCATCTCC 3 567 
Leu 1074 
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TGGCCTGGCCTGGCCGGGCTTCCTGTCAGCCAGGCTGCCCTTMCAGCTCT 
m^crcCTGACGTGTTC^ 

C^A^^TGGGAAA^ 

rrr^AGACCGAATCCCT^ 



TTTCO 




TCTGTTCAGCCAGCTACCCCTCAAGGAAT^ 

Saagacsaccctagcctccc^ 

WoLGAAGAAAGCOtfnrCT^^ 



w^wCJUSAAC^GCa^CTCCTCCCTATGATC 
GGGGCCMTAGGCAGCCTAATTAATGCTGGAGGCTGAGCCAAGTACAGGA 

CCTTCCCCAGGGCACTTGGAGaCACGCAGCCATAG 
TCAGTCCTGGGGCTTTTTCTTTATCACCCTCAG 
CGGGCCXXGCATCTGTGATGAGAATGTAAATGTGCCAGTGT 
ATGGCCCTGGCTCTGCATTGGACCrrcCT^ 



CTAGAGTATCCAGGTGGTTGCACATTTGTCCAGATC 

TGGGGGTCAGCTGGGCTCCTGGGAGATTCCAGA^ 

CCCCTTCCCCAGGCXICCaGCAAGTCTCAAGAACACAGCPGCACAGG 

CGGTGTCCTGGAAAGCCCCCAGCAGCTGCCCCAGGGACATGGGAAGACCACGGGACCTCTCTCACT 

CCCACGATGACCTCCGGGGGTATCCTGGGCAAAAGGGACAAAGAGGGCAAATC^ 
AGCCCACCACTCCAGCACCTGTGCCGAGGTCTGCGTCGAAGACAGAATGGACAGTGAGGACAG^ 

GTCTTGOAAAAGAaUVGAAGCTTCAGATGGGTACCCCAAGAAGGA 

GGTTTGCCCCTCACCCACCAGCTGCCCCATCCCTGAGGCAGCGCTCCATGGGGG 

CTGCCCAGACCTAGCAGTGACATCTCATTOTCCCCAGCC^ 

CAGGGTTGTAGCCAAGACGCCCCCGCACGGGGAGGGTTGGGAAGGGGGTGCAGGAAGCTCAACCCCT 

CTGGGCACCAACCCrrGCATTGCAGGTTGGCACCTTACTTCCCTGGGATC 

GGGAGAGTWJGTTCTCAATACGGTACCAAAGATATAATCACCTAGGTCT 

CACGTTAAGTCACATTTATACAG CAGAAATGCTATTTTGTATG CXGTTAAGTTTTTCTATCTGTGTA 
CTTTTTTCTAAGGGAAAGATTTTAATATTAAACCTGGTGCTTCTCACTCAC 

^•7 



3634 

3701 

376B 

3835 

3902 

3969 

4036 

4103 

4170 

4237 

4304 

4371 

4438 

4505 

4572 

4639 

4706 

4773 

4840 

4907 

4974 

5041 

5108 

5175 

5242 

5309 

5376 

5427 
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type A human platelet-derived growth factor receptor 
polypeptide. Both a nucleic acid sequence and its 
corresponding protein sequence are provided* The nucleic acid 
sequence corresponds to Seq. ID No. 5. The amino acid sequence 
corresponds to Seq. ID No* 4. Another human type A allele 
sequence is reported in Matsui et al. (1989) Science 243:800- 
803. 
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Sequence of a human type A 
PDGF receptor polypeptide allele and protein 

TTGGAGCTACAGGGAGAGAAACAGAGGAGGAGACTG 61 

acectcttS^tgtgtggg^ 128 

ATG GGG ACT TCC CAT CCG GCG TTC CTG GTC TTA GGC TGT CTT CTC ACA GGG 179 
Met 3y Thr Ser His Pro Ala Phe Leu Val Leu Gly Cys Leu Leu Thr Gly -7 

ffrs AGC CTA ATC CTC TGC CAG CTT TCA TTA CCC TCT ATC CTT CCA AAT GAA 
Ser 2u lie £eu Cys Gin Leu Ser Leu Pro Ser lie Leu Pro Asn Glu 

AAT GAA AAG GTT GTG CAG CTG AAT TCA TCC TTT TCT CTG AGA TGC TTT GGG 
i£ Glu lys val Val Gin Leu Asn Ser Ser Phe Ser Leu Arg Cys Phe Gly 

GAG AGT GAA GTG AGC TGG CAG TAC CCC ATG TCT GAA GAA GAG AGC TCC GAT 

Su Ser Glu Val Ser Trp Gin Tyr Pro Met Ser Glu Glu Glu Ser Ser Asp 

GTG GAA ATC AGA AAT GAA GAA AAC AAC AGC GGC CTT TTT GTG ACG GTC TTG 
Val Glu lie Arg Asn Glu Glu Asn Asn Ser Gly Leu Phe Val Thr Val Leu 

GXA GTG AGC AGT GCC TCG GCG GCC CAC ACA GGG TTG TAC ACT TGC TAT TAC 
Val Ser Ser Ala Ser Ala Ala His Thr Gly Leu Tyr Thr Cys Tyr Tyr 

AAC CAC ACT CAG ACA GAA GAG AAT GAG CTT GAA GGC AGG CAC ATT TAC ATC 
Asn His Thr Gin Thr Glu Glu Asn Glu Leu Glu Gly Arg His He Tyr. He 

TAT GTG CCA GAC CCA GAT GTA GCC TTT GTA CCT CTA GGA ATG ACG GAT TAT 
Tyr Val Pro Asp Pro Asp Val Ala Phe Val Pro Leu Gly Met Thr Asp Tyr 

TTA GTC -ATC GTG GAG GAT GAT GAT TCT GCC ATT ATA CCT TGT CGC ACA ACT 
Leu Val He Val Glu Asp Asp Asp Ser Ala He He Pro Cys Arg Thr Thr 

GAT CCC GAG ACT CCT GTA ACC TTA CAC AAC AGT GAG GGG GTG GTA CCT GCC 
Asp Pro Glu Thr Pro Val Thr Leu His Asn Ser Glu Gly Val Val Pro Ala 

TCC TAC GAC AGC AGA CAG GGC TTT AAT GGG ACC TTC ACT GTA GGG CCC TAT 
Ser Tyr Asp Ser Arg Gin Gly Phe Asn Gly Thr Phe Thr val Gly Pro Tyr 

ATC TGT GAG GCC ACC GTC AAA GGA AAG AAG TTC CAG ACC ATC CCA TTT AAT 
' He Cys Glu Ala Thr Val Lys Gly Lys Lys Phe Gin Thr He Pro Phe Asn 

GTT TAT GCT TTA AAA GCA ACA TCA GAG CTG GAT CTA GAA ATG GAA GCT CTT 
Val Tyr Ala Leu Lys Ala Thr Ser Glu Leu Asp Leu Glu Met Glu Ala Leu 

AAA ACC GTG TAT AAG TCA GGG GAA ACG ATT GTG GTC ACC TGT GCT GTT TTT 
lys Thr Val Tyr Lys Ser Gly Glu Thr He Val Val Thr cys Ala Val Phe 

AAC AAT GAG GTG GTT GAC CTT CAA TGG ACT TAC CCT GGA GAA GTG AAA GGC 
Asn Asn Glu. Val Val Asp Leu Gin Trp Thr Tyr Pro Gly Glu Val Lys Giy 



230 
11 

281 

28 . 

332 
45 

383 
62 

434 
79 

485 
96 

536 
113 

587 
130 

638 
147 

689 
164 

740 
181 

791 
198 

842 
215 

893 
232 
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AAA GGC ATC ACA ATG CTG GAA GAA ATC AAA GTC CCA TCC ATC AAA TTG GTG .944 
Lys Gly lie Thr Met Leu Glu Glu lie Lys Val Pro Ser lie Lys Leu Val 249 

TAC ACT TTG ACG GTC CCC GAG GCC ACG GTG AAA GAC AGT GGA GAT TAC GAA 995 
Tyr Thr Leu Thr Val Pro Glu Ala Thr Val Lys Asp Ser Gly Asp Tyr Glu 266 

TGT GCT GCC CGC CAG GCT ACC AGG GAG GTC AAA GAA ATG AAG AAA GTC ACT 1046 
Cys Ala Ala Arg Gin Ala Thr Arg Glu Val Lys Glu Met Lys Lys Val Thr 283 

ATT. TCT GTC CAT GAG AAA GGT TTC ATT GAA ATC AAA CCC ACC TTC AGC CAG 1097 
lie Ser Val His Glu Lys Gly Phe lie Glu lie Lys Pro Thr Phe ser Gin 300 

TTG GAA GCT GTC AAC CTG CAT GAA GTC AAA CAT TTT GTT GTA GAG GTG CGG 1148 
Leu Glu Ala Val Asn Leu His Glu Val Lys His Phe val Val Glu Val Arg 317 

GCC TAC CCA CCT CCC AGG ATA TCC TGG CTG AAA AAC AAT CTG ACT CTG ATT 1199 
Ala Tyr Pro Pro Pro Arg lie Ser Trp Leu Lys Asn Asn Leu Thr Leu lie 334 

GAA AAT CTC ACT GAG ATC ACC ACT GAT GTG GAA AAG ATT CAG GAA ATA AGG 1250 
Glu Ash Leu Thr Glu lie Thr Thr Asp Val Glu Lys lie Gin Glu He Arg 351 

TAT CGA AGC AAA TTA AAG CTG ATC CGT GCT AAG GAA GAA GAC AGT GGC CAT 1301 
Tyr Arg Ser Lys Leu Lys Leu He Arg Ala Lys Glu Glu Asp Ser Gly His 368 

TAT ACT ATT GTA GCT CAA AAT GAA GAT GCT GTG AAG AGC TAT ACT TTT GAA 1352 
Tyr Thr He Val Ala Gin Asn Glu Asp Ala Val Lys Ser Tyr Thr Phe Glu 385 

CTG TTA ACT CAA GTT CCT TCA TCC ATT CTG GAC TTG GTC GAT GAT CAC CAT 1403 
Leu Leu Thr Gin Val Pro Ser Ser He Leu Asp Leu Val Asp Asp His His 402 

GGC TCA ACT GGG GGA CAG ACG GTG AGG TGC ACA GCT GAA GGC ACG CCG CTT 1454 
Gly Ser Thr Gly Gly Gin Thr Val Arg Cys Thr Ala Glu Gly Thr Pro Leu 419 

CCT GAT ATT GAG TGG ATG ATA TGC AAA GAT ATT AAG AAA TGT AAT AAT GAA 1505 
Pro Asp He Glu Trp Met He Cys Lys Asp He Lys Lys Cys Asn Asn Glu 436 

ACT TCC TGG ACT ATT TTG GCC AAC AAT GTC TCA AAC ATC ATC ACG . GAG ATC 1556 
Thr Ser Trp Thr lie Leu Ala Asn Asn Val Ser Asn He He Thr Glu He 453 

CAC TCC CGA GAC AGG AGT ACC GTG GAG GGC CGT GTG ACT TTC GCC AAA GTG 1607 

His Ser Arg Asp Arg Ser Thr Val Glu Gly Arg Val Thr Phe Ala Lys Val 470 

. GAG GAG ACC ATC GCC GTG CGA TGC CTG GCT AAG AAT CTC CTT GGA GCT GAG 1658 
Glu Glu Thr lie Ala Val Arg Cys Leu Ala Lys Asn Leu Leu Gly Ala Glu 4B7 

AAC CGA GAG CTG AAG CTG GTG GCT CCC ACC CTG CGT TCT GAA CTC ACG GTG 1709 

Asn Arg Glu Leu Lys Leu Val Ala Pro Thr Leu Arg Ser Glu Leu Thr Val 504 

GCT GCT GCA GTC CTG GTG CTG TTG GTG ATT GTG ATC ATC TCA CTT ATT GTC 1760 
Ala Ala Ala Val Leu Val Leu Leu Val He Val He, He Ser Leu He Val 521 
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CTG GTT GTC ATT TGG AAA CAG AAA CCG AGG TAT GAA ATT CGC TGG AGG GTC 1811 

Leu Val Val lie Trp Lys Gin Lys Pro Arg Tyr Glu lie Arg Trp Arg Val 538 

ATT GAA TCA ATC AGC CCA GAT GGA CAT GAA TAT ATT TAT GTG GAC CCG ATG 1862 
He Gin ser He Ser Pro Asp Gly His Glu Tyr He Tyr Val Asp Pro Met 555 

CAG CTG CCT TAT GAC TCA AGA TGG GAG TTT CCA AGA GAT GGA CTA GTG CTT. 1913 
Gin Leu Pro Tyr Asp Ser Arg Trp Glu Phe Pro Arg Asp Gly Leu Val Leu 572 

GGT CGG GTC TTG GGG TCT GGA GCG TTT GGG AAG GTG GTT GAA GGA ACA GCC 1964 
Gly Arg Val Leu Gly Ser Gly Ala Phe Gly Lys Val Val Glu Gly Thr Ala 589 

TAT GGA TTA AGC CGG TCC CAA CCT GTC ATG AAA GTT GCA GTG AAG ATG CTA 2015 
Tyr Gly Leu Ser Arg Ser Gin Pro Val Met Lys Val Ala Val Lys Met Leu 606 

AAA CCC ACG GCC AGA TCC AGT GAA AAA CAA GCT CTC ATG TCT GAA CTG AAG 2066 
l£s Pro Thr Ala Arg ser Ser Glu Lys Gin Ala Leu Met Ser Glu Leu Lys 623 

ATA ATG ACT CAC CTG GGG CCA CAT TTG AAC ATT GTA AAC TTG CTG GGA GCC 2117 
He Met Thr His Leu Gly Pro His Leu Asn He Val Asn Leu Leu Gly Ala 640 

TGC ACC AAG TCA GGC CCC ATT TAC ATC ATC ACA GAG TAT TGC TTC TAT GGA 2168 
cys Thr Lys Ser Gly Pro He Tyr He He Thr Glu Tyr Cys Phe Tyr Gly 657 

GAT TTG GTC AAC TAT TTG CAT AAG AAT AGG GAT AGC TTC CTG AGC CAC CAC 2219 
Asp Leu Val Asn Tyr Leu His Lys Asn Arg Asp Ser Phe Leu Ser His His 674 

CCA GAG AAG CCA AAG AAA GAG CTG GAT ATC TTT GGA TTG AAC CCT GCT GAT 2270 
Pro Glu Lys Pro Lys Lys Glu Leu Asp He Phe Gly Leu Asn Pro Ala Asp 691 

GAA AGC ACA CGG AGC TAT GTT ATT TTA TCT TTT GAA AAC AAT GGT GAC TAC 2321 
Glu Ser Thr Arg Ser Tyr Val He Leu Ser Phe Glu Asn Asn Gly Asp Tyr 708 

ATG GAC ATG AAG CAG GCT GAT ACT ACA CAG TAT GTC CCC ATG CTA GAA AGG 2372 
Met Asp Met Lys Gin Ala Asp Thr Thr Gin Tyr Val Pro Met Leu Glu Arg 725 

AAA GAG GTT TCT AAA TAT TCC GAC ATC CAG AGA TCA CTC TAT GAT CGT CCA 2423 
Lys Glu val Ser Lys Tyr Ser Asp lie Gin Arg Ser Leu Tyr Asp Arg Pro 742 

GCC TCA TAT AAG AAG AAA TCT ATG TTA GAC TCA GAA GTC AAA AAC CTC CTT 2474 
Ala Ser Tyr Lys Lys Lys Ser Met Leu Asp Ser Glu Val Lys Asn Leu Leu 759 

TCA GAT GAT AAC TCA GAA GGC CTT ACT TTA TTG GAT TTG TTG AGC TTC ACC 2525 
Ser Asp Asp Asn Ser Glu Gly Leu Thr Leu Leu Asp Leu Leu Ser Phe Thr 776 

TAT CAA GTT GCC CGA GGA ATG GAG TTT TTG GCT TCA AAA AAT TGT GTC CAC 2576 
^yr Gin Val Ala Arg Gly Met Glu Phe Leu Ala Ser Lys Asn Cys Val Hxs 793 

CGT GAT CTG GCT GCT CGC AAC GTT CTC CTG GCA CAA GGA AAA ATT GTG AAG 2627 
Arg Asp Leu Ala Ala Arg Asn Val Leu Leu Ala Gin Gly Lys He Val Lys 810 
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ATC TGT GAC TTT GGC CTG GCC AGA GAC ATC ATG CAT GAT TCG AAC TAT GTG 2678 
lie Cys Asp Phe Gly Leu Ala Arg Asp He Met His Asp Ser Asn Tyr Val 827 

TCG AAA GGC AGT ACC TTT CTG CCC GTG AAG TGG ATG GCT CCT GAG AGC ATC 2729 
Ser Lys Gly Ser Thr Phe Leu Pro Val Lys Trp Met Ala Pro Glu Ser He 844 

TTT GAC AAC CTC TAC ACC ACA CTG AGT GAT GTC TGG TCT TAT GGC ATT CTG 2780 
Phe Asp Asn Leu Tyr Thr Thr Leu Ser Asp Val Trp Ser Tyr Gly He Leu 861 

CTC TGG GAG ATC TTT TCC CTT GGT GGC ACC CCT TAC CCC GGC ATG ATG GTG 2831 
Leu Trp Glu He Phe Ser Leu Gly Gly Thr Pro Tyr Pro Gly Met Met Val 878 

GAT TCT ACT TTC TAC AAT AAG ATC AAG AGT GGG TAC CGG ATG GCC AAG CCT 2882 
Asp Set Thr Phe Tyr Asn Lys lie Lys Ser Gly Tyr Arg Met Ala Lys Pro 895 

GAC CAC GCT ACC AGT GAA GTC TAC GAG ATC ATG GTG AAA TGC TGG AAC AGT 2933 
Asp His Ala Thr Ser Glu Val Tyr Glu He Met Val Lys cys Trp Asn Ser 912 

GAG CCG GAG AAG AGA CCC TCC TTT TAC CAC CTG AGT GAG ATT GTG GAG AAT 2984 
Glu Pro Glu Lys Arg Pro Ser Phe Tyr His Leu Ser Glu He Val Glu Asn 929 

CTG CTG CCT GGA CAA TAT AAA AAG AGT TAT GAA AAA ATT CAC CTG GAC TTC 3035 
Leu Leu Pro Gly Gin Tyr Lys Lys Ser Tyr Glu Lys He His Leu Asp Phe 946 

CTG AAG AGT GAC CAT CCT GCT GTG GCA CGC ATG CGT GTG GAC TCA GAC AAT 3086 
Leu Lys Ser Asp His Pro Ala Val Ala Arg Met Arg Val Asp Ser Asp Asn 963 

GCA TAC ATT GGT GTC ACC TAC AAA AAC GAG GAA GAC AAG CTG AAG GAC TGG 3137 
Ala Tyr lie Gly Val Thr Tyr Lys Asn Glu Glu Asp Lys Leu Lys Asp Trp 980 

GAG GGT GGT CTG GAT GAG CAG AGA CTG AGC GCT GAC, AGT GGC TAC ATC ATT 3188 
Glu Gly Gly Leu Asp Glu Gin Arg Leu Ser Ala Asp Ser Gly Tyr He He 997 

CCT CTG CCT GAC ATT GAC CCT GTC CCT GAG GAG GAG GAC CTG GGC AAG AGG 3239 

Pro Leu Pro Asp He Asp Pro Val Pro Glu Glu Glu Asp Leu Gly Lys Arg 1014 

AAC AGA CAC AGC TCG CAG ACC TCT GAA GAG AGT GCC ATT GAG ACG GGT TCC 3290 
Asn Arg His Ser Ser Gin Thr Ser Glu Glu Ser Ala He Glu Thr Gly Ser 1031 

AGC AGT TCC ACC TTC ATC AAG AGA GAG GAC GAG ACC ATT GAA GAC ATC GAC 3341 
Ser Ser Ser Thr Phe He Lys Arg Glu Asp Glu Thr He Glu Asp He Asp 1048 

ATG ATG GAC GAC ATC GGC ATA GAC TCT TCA GAC CTG GTG GAA GAC AGC TTC 33 92 
Met Met Asp Asp He Gly He Asp Ser Ser Asp Leu Val Glu Asp Ser Phe 1065 

CTG TAACTGGCGGATTCGAGGGGTTCCTTCCACTTCTGGGGCCACCTCTGGATCCCGTTCAGAAAA 345B 
Leu 1066 

CCACTTTATTGCAATGCGGAGGTTGAGAGGAGGACrTGGTTGATGTTTAAAGAGAAGTTCCCAGCCA 3525 
AGGGCCTCGGGGAGCCTTTCTAAATATGAAT^ 3592 
CCTCTTGCAATGCCTCAGTAGCATCTCAGTGGTGTGTGAAGTTTGGAGATAGATGGATAAGGGT^ATA 3 659 
ATAGGCCACAGAAGGTGAACTTTCTGCTTCAAGGACATTGGTGAGAGTCCAACAGACACAATTTATA 3726 
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Table 2, page 5 

AJ^CAArTAAOTCCCTCTGAAAT^^ J»J 
CTCAATCTAGATCOVTAAACCTCTrcCTGAACATAACTTCT^^ 

ATGA^^GCGCANAAAGACTGG ATTTG^GAAGTTNlUTlTITri 1 1 T ^^E^^PjE^3^^Qj!g JS? 



4798 
4865 



TAITTTTCTAAACTCCCTGGCTGTTCrc^^ 

JEsSssssS^^ till 

CATAAATTTAGGAACCGAAGTCCAAIcaCTGTAAA^ 



WCTACGATGAAGGGTGACATACCCCCTCTAA^ ZII,, 

GGTCGAGT^CAAAACCACCTTITITGTAAAACT "67 

SSSS^tt^ 5334 

acgaatSgactatggtatac™ 5401 

ICIACTIiroAAfcCCTTCTACGTCTTCGTXATTATTTCA 546B 

^Sc^TCAAAGAATAGGraAAAAAGTATCATTC^^ 5535 
TTTTTCTm^ACTGATCCAAGA^ 

TGTCGAGTTCGTAAGACAAATAGCGAGTGAGAGGGAACATGTCGGAM 5669 

AAACTATAAOSACACTCGGAACGTACTGTAGTACT 5"" 

CTCAGGATTGTTTACGAGGGTGGACTrAAACATATACTGACGTAAACAC^ "03 
CGTnAAGGTCTAAACAAAGGAAAACCGGAGGACGTTTCAGAGGTCTTCTTTrA^ 

gatgaaagataaaaatactactgttagtttcggccm f 0 3 J 

TAATCACTACAGGAATTTTACACCAGACGGTTAGACA^ ™°« 
CTATTCTATTTTAC^CAATATGTA^ 

ATGAATGTACAAGGGTTTTGTTACCACACCACWACACACTCTTOT "7Z 

TOATGTTOCATAATGCTTACGGGGA^ "39 
TATAAAGTTATTTACTATATATTAAATTTCCTTAAG 
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A polypeptide or nucleic acid is substantially pure, 
or substantially purified, when it comprises at least about 30% 
of the respective polymer in a composition, typically at least 
about 50%, more typically at. least about 70%, usually at least 
5 about 80%, more usually at least about 90%, preferably at least 
about 95%, and more preferably about 98% or more . 

The soluble fragments of the extracellular region 
will generally be less than about 400 amino acids, usually less 
than about 350 amino acids, more usually less than about 300 
10 amino acids, typically less than about 200 amino acids, and 
preferably less than about 150 amino acids. 

A. D Domains 

Based on a number of observations, the extracellular 

15 region (XR) of these PDGF receptor polypeptides comprises 5 
immunoglobulin-like domains. First, the amino acid sequence 
contains 5 segments characteristic of Ig-like domain 
structures, each of the segments having an appropriate size for 
an immunoglobulin domain. Each segment, except for the fourth, 

20 has characteristically spaced cysteine residues that are a 
diagnostic feature of an immunoglobulin-like domain. The 
receptor polypeptide sequence displays other features of 
immunoglobulin-like domain structure, e.g., the presence of 
characteristically positioned tryptophan and tyrosine residues. 

25 Direct sequence comparisons of segments of the receptor 

polypeptides with corresponding segments of true immunoglobulin 
domains shows a statistically significant similarity between 
PDGF receptor polypeptide domains and immunoglobulin domains . 
See, e.g., Williams (1989) Science 243: 1564-1570. The 

30 argument that the receptor polypeptide domains assume the 

folding pattern of immunoglobulin domains can be strengthened 
by examining the predicted secondary structure of the receptor 
polypeptides. 

When a homology mapping analysis is performed, the 
35 PDGF receptor polypeptide shows five Ig-like domains in the 
extracellular region, each domain showing statistically 
significant homology, to defined Ig-like domains. See, e.g., 
Williams and Barclay (1988) Arm. Rev. Immunol". Biochem. 6: 381- 
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405. Regions of homology will show significant sequence 
homology to particular Ig-like domains, and exhibit particular 
secondary and tertiary structural motifs characteristic of Ig- 
like domains. The domain structures will preferably be those 
5 segments with boundaries which approximately match the 
boundaries of the domain structures. The boundaries will 
preferably match within about 9 amino acids, typically within 
about 7 amino acids, more typically within about 5 amino acids , 
usually within about 3 amino acids, and more usually within 1 

10 amino acid. See, e.g., Cantor and Schimmel (1980) Biophysical , 
Chemistry , Vols I-III, Freeman and Co., San Francisco? 
Creighton (1984) Proteins; Structu re and Molecular Properties, . 
Freeman and Co., New York; and Watson et al. (1987) The 
Molecular Biology of the Gene . Vols 1 and 2, Benjamin, Menlo 

15 Park, California; each of which is hereby incorporated herein 
by reference. 

The sequences of the human type B and the human type 
A receptor polypeptides can be analyzed to predict their beta 
strand topology. Combining a Fourier analysis of hydrophobic 

20 sequence pattern and a Garnier-Robson algorithm, see, e.g., 
Gamier et al. (1978) J. Mol. Biol. 120: 97, with a turn 
predictor program, as reported in Cohen et al. (1986) 
Biochemistry 25: 266, produces a characteristic structural 
pattern. This pattern exhibits consensus 0-strand segments in 

25 each domain when analysed as described. 

. The first two Ig-like domains of the PDGF receptor 
polypeptides, Dl and D2, have about seven P -strand segments, 
designated the A, B, C, D, E, F, and G segments, as listed from 
amino proximal to carboxy proximal direction. The third, 

30 fourth and fifth Ig-like domains, D3, D4 and D5, are long 
enough to include an extra 0-strand segment, designated C. 
The fifth domain, D5, most closely resembles a variable heavy 
. chain domain in length. The type B receptor polypeptide D5 
further comprises an additional 0-strand segment designated C". 

35 These features and designations are based partly on the 

homology of segments between domains and segments in the type B 
and type A hPDGF-R polypeptides, and with the mouse type B PDGF 
receptor polypeptide, and also based upon homology to other Ig- 
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like segments found on other proteins, particularly other 
growth factor receptor proteins. The csf-1 receptor and c-kit 
proto-oncogene have similar Ig-like domain" organizations. See, 
e.g., Williams (1989) Science 243:1564-1570. 
5 The domain structure is based, in part, upon features 

common to Ig-like domains found in other proteins, including 
related receptors. See, e.g. , Ullrich and Schlessinger (1990) 
Cell 61:203-212; and Yarden and Ullrich (1988) Ann. Rev. 
Biochem. 57:443-78. The domain boundaries for the two alleles 
10 disclosed herein are identified below, but different alleles 
may have slightly different positions for the boundaries. See 
Table 14. 

The Ig-like domains (D domains) are characterized by 
the regularity of spacing of cysteine residues in the 

15 extracellular region. These five D domains, each about 100 

amino acids in length, have £-sheet rich structures, resembling 
immunoglobulin variable or constant regions. See, Williams 
(1989) Science 243:1964-1570. The natural XR domains are 
numbered from the amino proximal domain Dl, in order, through 

20 D5, at the carboxy proximal end of the XR. 

The exon structure of the mouse type B PDGF receptor 
polypeptide gene also matches this domain structure with 
reasonable fidelity. The correlation between the intron-exon 
structure and functional units further supports the hypothesis 

25 that the boundaries define functional units of the polypeptide. 
See, e^g. , Williams and Barclay (1988) Ann . Rev . Immunol . 
Biochem. 6 : 381-405 . The boundaries for each of these segments 
are indicated below for the two alleles disclosed herein, and 
similar boundaries will be found in other alleles at locations 

30 of sequence and functional homology. 

The amino-proximal Ig-like domain of the humaa 
platelet-derived growth factor receptor polypeptides is 
designated Dl. The Dl domain extends from about leu(l) to 
pro (91) in the type B receptor polypeptide, and from about 

35 gln(l) to pro (101) in the type A receptor polypeptide. See 
Table 14. The Dl domain apparently has about seven 0-sheet 
segments . 
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The next Ig-like domain, in the carboxy proximal 
direction of natural human platelet-derived growth factor 
receptor polypeptides, is designated D2. The D2 domain extends 
from about thr(92) to ser(181) in the type B receptor 

5 polypeptide, and from about asp (102) to ser(189) in the type A 
receptor polypeptide. The D2 domain apparently also has about 
seven 0-sheet strands designated A, B, C, D, E, F, and G. 

The third Ig-like domain found on natural human PDGF 
receptor polypeptides is designated D3. The D3 domain extends 
10 from about ile(182) to gly(282) in the type B receptor 

polypeptide, and from about glu(190) to gly(290) in the type A 
receptor polypeptide. The D3 domain apparently has about eight 
P -sheet strands designated A, B, C, C 1 , D, E, F, and G. 

The fourth Ig-like domain found in the natural human 

15 PDGF receptor polypeptides is designated D4. The D4 domain 

extends from about tyr(283) to pro (384) in the type B receptor 
polypeptide, and from about phe(291) to pro (391) in the type A 
receptor polypeptide. The D4 domain apparently has about eight 
0-sheet strands. Note that the D4 domains lack the 

20 characteristic cysteine residues, which correspond to val(306) 
and met (364) in the type B sequence shown, and to val(313) and 
ile(371) in the type A sequence shown. 

The fifth Ig-like domain is designated D5. The D5 
domain extends from about val(385) to lys(499) in the type B 

25 receptor polypeptide, and from about ser(392) to glu(501) in 

the type A receptor polypeptide. The D5 of the type B receptor 
polypeptide has about nine putative 0-sheet strand segments 
designated A, B, C, C", C M , D, E, F, and G, while the type A 
receptor polypeptide has only about eight 0-strand segments, 

30 lacking a C" segment. 

. The approximate boundaries of the domains and 0- 
strand segments are listed in Table 14. The apparent 
alignments of the segments are illustrated in Tables 4 and 5. 
Other alleles of the receptor polypeptides may also be analyzed 

35 by either homology or the structural analysis as described 
above. 
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The prototypical Dl domains are those sequences of 
the human type B receptor polypeptide and the human type A 
receptor polypeptide, as described. However, compatible amino 
acid substitutions, insertions, and deletions which preserve 
the desired ligand binding functions can be made. The function 
will usually be preserved by retaining the LBR segments in the 
correct orientation by use of appropriate structured segments. 
Conservative substitutions typically include substitutions 
within the following groups: glycine, alanine; valine, 
isoleucine, leucine; aspartic acid, glutamic acid; asparagine, 
glutamine; serine, threonine; lysine, arginine? and 
phenylalanine, tyrosine. Substitution or exchange of 0-sheet 
segments or sequences intermediate the segments from different 
domains may be performed, including between type B and A 
receptor polypeptides, or between different domains of another 
related receptor polypeptide. Segments outside the 
prototypical cysteines within ^-segments B and F (but val(306) 
and met(364) in the type B D4, and val(313) and ile(371) in the 
type A D4) will be usually less critical than the sequences 
between those residues, e.g., the C, C, C», D; and E /?-strand 
segments. Also, segments homologous to these disclosed 
segments may be substituted, including those with compatible 
amino acid substitutions, insertions, and deletions. Sources 
of similar domains and segments include related receptor 
polypeptides from human or other mammalian species. Non- 
mammalian receptor polypeptides may also exhibit significant 
homology and serve as sources for similar segments. Other Ig- 
like domains and segments may also be substituted. 

The present invention embraces polypeptides which 
exhibit homology to the disclosed and described segments and 
domains. It embraces segments comprising contiguous amino 
acids of the sequences disclosed, . typically at least about 8 
contiguous amino acids, more typically at least about 11 
contiguous amino acids, usually at least about 14 contiguous 
amino acids, more usually at least about 17 contiguous amino 
acids, and preferably at least about 21 or more contiguous 
amino acids. Constructs retaining the LBR segments are most 
valuable. The invention also includes modifications of those 




WO 92/13867 PCT/US92/00730 

33 

sequences, including insertions, deletions, and substitutions 
with other amino acids. Glycosylation modifications, either 
changed, increased amounts, or decreased amounts, as well as 
other sequence modifications are envisioned. Thus, the 
5 modified proteins comprising these amino acid sequences, e.g., 
analogues, will usually be substantially equivalent to these 
proteins in either function or structure. 

The 0-sheet strands may be slightly enlarged or 
shortened by respective insertions or deletions in the 

10 polypeptide sequence. Thus, certain embodiments will have a 
slightly enlarged or shortened particular domain by adding or 
deleting particular sequences of 0-sheet strands or their 
inter-strand sequences. Segments may be inserted or deleted 
which conform to the structural requirements of retaining the 

15 proper intra- and inter-domain interactions. In particular, 

changes which interrupt the secondary and tertiary structure of 
the protein will be disfavored. See, e.g., Cantor and Schimmel 
(1990) and Creighton (1984). In addition, amino acids or 
segments may be inserted or deleted in the regions outside of 

20 the 0-sheet strands and between domains. Typically the 

substitutions will be of amino acids having similar properties, 
and additions or deletions would preferably be selected among 
those which retain receptor biological functions, e.g., ligand 
binding. 

25 The sequence of a 0-sheet segment will typically not 

differ from a sequence from a human type B polypeptide or a 
human type A polypeptide by greater than about 50%, more 
typically less than about 39%, usually less than about 29%, and 
more usually less than . about 20%. Comparable similarities over 

30 each of the non-0-sheet strands of each domain will be 
preferred. 

The boundaries between domains are defined, in part, 
by. the definitions for domains in the Ig-like domains. 
Examples of similar domains are found in immunoglobulin and - 
35 growth factor receptor polypeptides. The domain boundaries 
between bl and D2; D2 and D3 ; D3 and D4; and D4 and D5 
correspond approximately to exon locations, further supporting 
the proposal that the domain structures correspond to 
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evolutionary and funcrtional units. See, e.g., Watson et al. 
(1987) The Molecular Biolo gy of the Gene, vols. 1 and 2, 
Benjamin, Menlo Park, California. 

The D2 domains have similar characteristics to the Dl 
5 domains, as shown by the alignments illustrated in Tables 4 and 
• 5. Both domains have 0-sheet segments designated A, B, C, D, 
E, F, and G. The domain 3 segments, or D3, also exhibit 
homology, but have an additional 0-strand segment designated 
C*. The D4 segments, or D4, have non-cysteine residues at ttie 
10 positions which typically correspond to cysteines in the other 
domains. In the type B allele shown, the residues are val(306) 
and met(364), while in the type A allele shown, the residues 
are val(313) and ile(371) . The D4 domains also have 0-strand 
segments designated c*. The domain 5, or D5, have the 
15 consensus cysteine residues and the additional C £-strand 

segments, and the type B receptor polypeptide has an additional 
C n £-strand segment. 

The present invention provides for various constructs 
comprising ligand binding constructs, typically comprising 
20 substantially intact domains. These constructs will have 
various uses, e.g., for binding ligands, or substituting for 
intact receptor polypeptides. For example, each of the 
separate domains may comprise a separate polypeptide alone, or 
may be fused to another peptide, such as the TH and IR regions 
25 of a receptor polypeptide, e.g., hPDGF-R. See,, e.g., Table 6. 
These individual single domain polypeptides will exhibit 
specific activity associated with thesfe specific domains, 
preferably as an agonist or antagonist for ligand binding, 
preferably with characteristics shared with the intact receptor 
30 polypeptide or XR. The domains may also preferably serve as 
competitive inhibitors of PDGF-R polypeptides, competing with 
natural PDGF-receptors to bind ligands. The present invention 
also provides repetitive sequences of a single domain. For 
example, a Dl domain by itself is provided, a Dl-Dl dimer in a 
35 single polypeptide is provided, a D1-D1-D1 triplet repeat is 
also provided. Likewise up to a large number of Dl domains 
which will exhibit many functions, e.g., immunological 
. properties, characteristic of various natural PDGF-R sequences. 
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Similar constructs of each of D2, D3, D4, and D5 are provided, 
along with combinations > See Tables 6, 7, 8, 9 and 10- These 
will often be soluble fragments of the XR, or may be fused to 
other polypeptides, including a PDGF-R TM segment, preferably 
5 with an IR segment also. 
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TABLE 6 

XR domain structure of single domain forms 

5 

Dl ®2 - D3 D4 D5 

10 TABLE 7 

XR domain structure of two domain forms 

Dl-Dl D2-D1 D3-D1 D4-D1 D5-D1 

15 D1-D2 D2-D2 D3-D2 D4-D2 D5-D2 

D1-D3 D2-D3 D3-D3 D4-D3 D5-D3 

D1-D4 D2-D4 D3-D4 D4-D4 D5-D4 

D1-D5 D2-D5 D3-D5 D4-D5 D5-D5 



20 
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TABLE 8 

XR domain structure of three domain forms 

Dl-W D2-W D3-W D4-W D5-W 

where W is each of the 25 possible combinations listed in 
TABLE 2, giving a total of 125 elements in this table 



TABLE 9 

35 XR domain structure of four domain forms 

Dl-X D2-X D3-X D4-X D5-X 

where X is each. of the 125 possible combinations 
40 listed in TABLE 5 , giving a total of 625 elements in 

this table 



TABLE 10 

XR domain structure of five domain forms 

Dl-Y D2-Y D3-Y D4-Y D5-Y 

50 where Y is each of the 625 possible combinations 

listed in TABLE 6, but not including the combination 
D1-D2-D3-D4-D5, giving a total of 3124 elements in 
this table 
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In addition, the present invention provides similar 
structures with spacer regions between the domain structures . 
In particular, the regions corresponding to the intra-cysteine 
residues of the domains shown in Tables 4 and 5 are useful, 
5 For example, a spacer polypeptide may be inserted between 
adjacent domains or do spaces between the important ligand 
binding segments, typically found within the intra-cysteine 
segments described, e.g., the B, C, C», C", D, E, and F 0- 
strand segments. Thus, for example, a polypeptide of the 
10 structure D1-X1-D2 is provided where XI is a spacer segment 
which is not a D domain. The order of the domains may be 
reversed, and the invention also provides polypeptides such as 
D2-D1, or D2-X1-D1. In particular, the non-D domain character 
of XI is provided to avoid the peptide D1-X1-D3 from 
15 describing,, or encompassing, D1-D2-D3. 

Another particularly preferred embodiment of the 
invention is a polypeptide having the described extracellular 
region domain structure combined with other segments of a human 
platelet -derived growth factor receptor, particularly the 
20 transmembrane segment (TM) and the intracellular region (IR) . 

Thus, the present invention provides for a receptor polypeptide 
which either has a modified order of the extracellular region 
domains in the amino to carboxy direction, e.g., a D5-D4-D3-D2- 
Dl-TM-IR polypeptide, or, in some cases reversal of various 
25 domains. It also provides for a receptor polypeptide with a 
deleted intact domain and for a receptor polypeptide having an 
additional domain added to it. Examples include D1-D2-D3-TM- 
IR, or D1-D2-D3-D4-TM-IR. In particular, fusions with the XR 
segments described in Tables 6, 7, 8, 9, and 10 are preferred 
30 embodiments. 

The modified combinations of the D domains are 
expected to both simulate and differ from the natural receptor. 
The modified polypeptide would be expected, in some 
embodiments, to exhibit a modified binding, affinity, e.g., 
35 higher or lower affinity, or to exhibit a different spectrum of 
binding to different ligands or ligand analogues. They may 
also have an altered ligand binding transducing efficiency, or 
a modified inter-chain, association affinity. 
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The present invention provides the means for 
determining the minimal structural features necessary to 
perform various functions of the extracellular region of 
platelet-derived growth factor receptors, preferably human 
5 receptors. Although similar determinations may be performed in 
mouse or other mammalian species, the human receptor will 
typically be preferred for diagnostic or therapeutic purposes. 

To determine the minimal region necessary for a 
functional activity, e.g., ligand binding, an assay for that 
10 activity is developed. The main receptor functions, as 
indicated above, include ligand binding, tyrosine kinase 
activity, and receptor dimerization. Simple and quick assays 
for each of these molecular functions may be developed. Ligand 
binding assays are described, e.g., in Gronwald et al. (1988) 
15 Prod, Nat'? ^d. Sci. USA 85:3435-3439; Heldin et al. (1988) 
EMBO J. 7:1387-1393? and Escobedo et al. (1988) Science 
240:1532-1534. Receptor dimerization assays are described, 
e.g., in Yarden and Schlessinger (1987) Biochemistry 26:1434- 
1442 and 1443-1451. 
20 As an alternative means for determining sites which 

interact with specif ic other proteins, physical structure 
determination, e.g., x-ray crystallography or 2 dimensional NMR 
techniques, will provide guidance as to which amino acid 
residues form the molecular contact regions. For a detailed 
25 description of protein structural determination, see,, e.g., 

Blundell and Johnson (1976) Protein Crystal lography, Academic 
Press, New York, which is hereby incorporated herein by 
reference. 

Ligand binding assays may include binding of labeled 
30 ligand or competition assays for binding. Signal transduction 
may be indirectly assayed by measuring an activity modulated by 
ligand binding, e.g., tyrosine kinase activity, or some measure 
of a conformational or other change in receptor structure. For 
example, an antibody or other binding protein which 
35 specifically binds or dissociated from the receptor polypeptide 
upon ligand binding may be used. Receptor dimerization may be 
measured by a proximity assay, including a fluorescence 
quenching or other spectroscopic measurement. Various 
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proximity assays are known, see f e.g., Ullrich and Schlessinger 
(1990) Cell 61:203-212; Yarden and Schlessinger (1987) 
Biochemistry 26:1434-1942 and 1443-1451; each of which is 
hereby incorporated herein by reference. 
5 Once an assay has been developed, various 

combinations of domain or other segments, e.g., LBR's, can be 
tested for affecting that activity. A competitive inhibition 
assay will detect those constructs which can bind the ligand. 
The first domain structures to try will "ordinarily be the 

10 individual domains, either alone or linked to chimeric proteins 
or the TM-IR segment of the receptor ♦ Various alleles, 
modifications to the individual domains, or related chimeric 
domains would be tested. Both deletion and chimeric proteins 
will be constructed. 

i5 Various combinations of each domain will be 

constructed and tested to select those which affect the 
measured activity. Repeats of those domains should be tested, 
e.g, Dl-Dl. If no single domain does affect the function, 
then various 2 domain constructs, in order, would be tried, 

20 e.g. , D1-D2 -TM-IR, D2-D3-TM-IR, D3-D4 -TM-IR, and D4-D5-TM-IR. 
Selected combinations listed in Tables 6, 7, 8 r 9, and 10 will 
be constructed and tested. 

In order to produce soluble forms, it will often be 
desireable to attach appropriate amino terminal segments , some 

25 of which would- be expected to be present in the Dl domain or in 
the precursor form. Correct secretion and processing may be 
dependent upon various amino proximal features, such as signal 
sequences , and other features essential for correct targeting 
and processing. See, e.g., Watson et al. (1987) The Molecular 

30 Biology of the Gene P vols* 1 and 2, Benjamin, Menlo Park, 
California. 

When correct domains have been selected which are 
especially effective in modulating or competing defined 
functions, a more detailed analysis, to the level of the j8- 
35 strand segments might be addressed. Various chimeric, 

deletion, insertion, or substitution constructs of each 0- 
strand or inter-strand segment may be generated and tested, as 
described above. Each construct could be produced using 
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methods of standard genetic engineering, especially using 
synthetic primers. Procedures for using such reagents are 
described, e.g.. in Sambrook, et al. (1989) Modular C ionipq; 
h T^nre+.nrv Manual , vols . 1-3 . Cold Spring Harbor Press , and 
Ausubel et al. (eds.) (1989) Current PrornrolB in Molecular 
BiPlfigy., Wiley, each, of which is hereby incorporated herein by 
reference. 

B. Soluble Forms 

In some embodiments, only the extracellular region is 
provided. Thus, the extracellular region alone, without the 
transmembrane segment, will often be a soluble polypeptide. It 
has been demonstrated that the entire extracellular region, 
separated from, and which lacks a transmembrane region and an 
intracellular region, still serves as a ligand binding 
polypeptide. In particular, the soluble polypeptide D1-D2-D3- 
D4-D5 has been demonstrated to bind various PDGF forms. 
Although the binding specificity for the PDGF form is 
dependent, to some extent, on the specific domains included, 
modifications to the specificity of the ligand binding may be 
effected by either substituting various different domains or 
rearranging the domains. Substitution with other homologous 
segments may also be performed, e.g., substituting an Ig-like 
domain from an antibody molecule, such as an antibody which 
binds a platelet-derived growth factor. Alternatively, a 
domain from a different related growth . factor or ligand 
receptor may be substituted, e.g., from an FGF receptor or 
another PDGF receptor. The order of the domains may also be 
modified, e.g., D5-D4-D3-D2-D1. 

In particular, the activities which will usually be 
of greatest importance with the extracellular constructs relate 
to the binding of the ligand. For example, it has been 
discovered that domains D4 and D5 are not essential for ligand 
binding of a soluble extracellular region PDGF-R ■ polypeptide . 
Of the remaining domains, if domain D3 is separated from 
domains Dl and D2, the construct D1-D2 binds the ligand only at 
low affinity, but a D1-D2-D3 construct binds ligand at high 
affinity. 
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A typical hPDGF-R nucleic acid sequence encodes a 
transitory amino terminal hydrophobic sequence, which is 
usually cleaved during the membrane translocation process. The 
classical function of a signal sequence is to direct the 
5 nascent polypeptide chain to membrane bound ribosomes, thereby 
leading to membrane translocation or cellular targeting. 
However, since the signal sequence is typically removed in the 
translocation process, the signal sequence is usually absent in 
a mature polypeptide* Often a signal sequence will be attached 
10 upstream of a desired soluble peptide of this invention. 

Solubility of a polypeptide depends upon the 
environment and the polypeptide. Many parameters affect 
polypeptide solubility, including the temperature, the 
electrolyte environment, the size and molecular characteristics 
15 of the polypeptide, and the- nature of the solvent. Typically, 
the temperature at which the polypeptide is used ranges from 
about 4'C to about 65*C. Usually the temperature at use is 
greater than about 18 *C and more usually greater than about 
22 *C. For diagnostic purposes, the temperature will usually be 
20 about room temperature or warmer, but less than the 

denaturation temperature of components in the assay. For 
therapeutic purposes, the temperature will usually be body 
temperature, typically about 37 *C for humans, though under 
certain situations the temperature may be raised or lowered in 
25 situ or in vitro. 

The electrolytes will usually approximate in situ 
physiological conditions, but may be modified to higher or 
lower ionic strength where advantageous. The actual ions may 
be modified to conform to standard buffers used in 
30 physiological or analytical contexts. 

The size and structure of the polypeptide should be 
in a substantially stable and globular state, and usually not 
in a denatured state. The polypeptide may be associated wi*th 
other . polypeptides in a quaternary structure, e.g., to confer 
35 solubility. 

The solvent will usually be a biologically compatible 
buffer, of a type used for preservation of biological 
activities, and will usually approximate a physiological 
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solvent, on some occasions, a detergent will be added, 
typically a mild non-denaturing one* 

Solubility is usually measured in Svedberg units, 
which are a measure of the sedimentation velocity of a molecule 
under particular conditions. The determination of the 
sedimentation velocity was classically performed in an 
analytical ultracentrifuge, but is typically now performed in a 
standard ultracentrifuge. See, Freifelder (1982) Physical 
biochemistry (2d ed.), W.H. Freeman, and Cantor and Schimmel 
(1980) Biophysical Chemistry , parts 1-3, W.H. Freeman & Co., 
San Francisco, each of which is hereby incorporated herein by 
reference. As a crude determination, a sample containing a. 
"soluble" polypeptide is spun in a standard full sized 
ultracentrifuge at about 5 OK rpm for about 10 minutes, and 
soluble molecules will remain in the supernatant. A soluble 
particle or polypeptide will typically be less than about 30S, 
more typically less than about 15S, usually less than about 
10S, more usually less than about 6S, and, in particular 
embodiments, preferably less than about 4S, and more preferably 

less than about 3S. 

This invention provides platelet-derived growth 
factor polypeptides and proteins having platelet-derived growth 
factor receptor ligand binding activity. The receptors of the 
present invention include PDGF. receptor amino acid sequences 
such as those shown in Tables 6, 7, 8, 9, and 10. Also 
provided are homologous sequences, allelic variations, induced 
mutants, alternatively expressed variants, and proteins, encoded 
by DNA which hybridize under high stringency conditions to PDGF 
receptor encoding nucleic acids retrieved from naturally 
occurring material. 

The platelet-derived growth factor receptor peptides 
of the present invention will exhibit at least about 80% 
homology with naturally occurring domains of hPDGF receptor 
sequences in the domains Dl, D2, D3, D4, and D5, typically at 
least about 85% homology with a natural form of a receptor 
sequence, more typically at least about 90% homology, usually 
at least about 95% homology, and more usually at least about 
97% homology. 
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Homology, for. polypeptides, is typically measured 
using sequence analysis software, see, e.g., Sequence Analysis 
Software Package of the Genetics Computer Group, University of 
Wisconsin Biotechnology Center, 1710 University Avenue, 
5 Madison, Wisconsin 53705. Protein analysis software matches 

similar sequences using measure of homology assigned to various . 
substitutions, deletions, substitutions, and other 
modifications* Similar, or homologous, substitutions for LBR 
segments will be made in known sequences, thereby producing new 
10 binding molecules having modified affinity or specificity of 
ligand binding. 

Various other software analysis programs can analyze 
the conformational structure of a polypeptide. Homologous 
conformation may also be achieved by appropriate insertion, 
15 deletion, substitution, or modification of amino acid 

sequences. Since the conformational structure of the domains 
and j9-strand segments is only partially understood, the present 
invention also encompasses various modifications to the 
sequences disclosed and retaining these structural features „ 
20 In particular, ligand binding function is believed to 

be localized to the extracellular domain, particularly the 
LBR's, and the soluble forms will preferably retain this 
particular function. Soluble fragments of PDGF receptors will 
be useful in substituting for or for interfering with, e.g., 
25 blocking, by competing for PDGF binding, the functions of the 
natural receptor both in vitro and in vivo . Alternatively, 
soluble forms may interfere with the dimerization of PDGF 
receptor polypeptides, since the proteins may normally be in, 
or function in, a dimer form. Receptor dimerization may be 
30 essential for proper physiological signal transduction, and 
introduction of fragments may function to interrupt these 
processes by blocking their dimerization. 

PDGF receptor polypeptides may be purified using 
techniques of classical protein chemistry, see, e.g., Deutscher 
35 (ed.) (1990) Guide to Purification : Methods in Enzymology, Vol. 
182, which is hereby incorporated herein by reference. 
Alternatively, a lectin affinity chromatography step may be 
used, or a highly specific ligand affinity chromatography 
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procedure, e.g., one that utilizes a PDGF conjugated to biotin 
through cysteine residues of the protein mitogen. Purified 
PDGF receptor polypeptides may also be obtained by a method 
such as PDGF affinity chromatography using activated CH- 
5 Sepharose coupled to PDGF through primary amino groups as 
described in Imamura et al. (1988) Biochem. Bioohvs. Res. 
Commun. 155:583-590. 

Depending on the availability of specific antibodies, 
specific PDGF receptor peptide constructs may also be purified 

10 using immuno-affinity chromatography. Antibodies prepared, as 
described below, may be immobilized to an. inert substance to 
generate a highly specific immuno-af f inity column. See, e.g.,. 

Harlow and Lane (1990) Monoclonal Antibodies: h Laboratory: 

Manual, Cold Spring Harbor Laboratory, which is hereby 

15 incorporated herein by reference. 

Various cells or tissues may be selected as starting 
materials, usually selected on the basis of abundant expression 
of the desired receptor construct or polypeptide. High 
expression promoter sequences may be operably linked to a 

20 recombinant sequence, preferably an inducible promoter. The 
promoter is operably linked when it operates to promote the 
sequence. Appropriate cells that contain relatively large 
amounts of the receptor protein, as determined by high affinity 
binding of PDGF, can be transformed with variants of the PDGF 

25 receptor polypeptides. These may be used to replace the 

natural form of PDGF receptor by a construct with a deletion or 
insertion. 

The ligand binding regions (-LBR's) or other segments 
may be "swapped" between different new fusion constructs or 

30 fragments. Thus, new chimeric polypeptides exhibiting new 

combinations of segments can result from the structural linkage 
of different functional domains. Ligand binding regions which 
confer desired or modified specificities may be combined with 
other domains which have another function, e.g., each Ig-like 

35 domain could be substituted by a similar domain from other 
related polypeptides, or LBR's between different alleles or 
similar receptors may be combined. 
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The present invention also provides for fusion 
polypeptides between the receptor polypeptide domains and other 
homologous or heterologous proteins. Homologous proteins may 
be fusions between similar but different growth factor 
5 receptors resulting in, e.g., a hybrid protein exhibiting 

ligand specificity of one receptor with an intracellular domain 
of another, or a receptor which may have altered affinity or a 
broadened or narrowed specificity of binding. Likewise, 
heterologous fusions, may be constructed which exhibit a 
10 combination of properties or activities of the derivative 
proteins. Typical examples are fusions of a reporter 
polypeptide, e.g., luciferase, with a domain of a receptor, 
e.g., a ligand binding domain from the extracellular region of 
a human platelet-derived growth factor receptor, so that the 
15 presence or location of a desired ligand may be easily 

determined. See, e.g.. Dull et al., U.S. Patent No. 4,859,609, 
which is hereby incorporated herein by reference. Other gene 
fusion partners include bacterial £-galactosidase, trpE, 
protein A, ^-lactamase, at-amylase, alcohol dehydrogenase, and 
20 yeast a-mating factor. See, e.g., Godowski et al., (1988) 
Science 241: 812-816. Additional sequences with various 
defined functions may be found by searching through the 
GenBank™ (National Institutes of Health) sequence data bank. A 
heterologous fusion protein is one which includes sequences not 
25 naturally found in conjunction with one another. Thus, a 

heterologous fusion protein may be a fusion of two similar* and 
homologous, sequences. 

Fusion proteins would typically be made by either 
recombinant nucleic acid methods with expression, or by 
30 synthetic polypeptide methods. Techniques for nucleic acid 

manipulation are described generally, for example, in Sambrook 
et al. (1989) Molecular Cloning: A Laboratory Manual (2nd ed. ) 
volumes 1-3, Cold Spring Harbor Laboratory, which is hereby 
incorporated herein by reference. Techniques for synthesis of 
35 polypeptides are described, for example in Merrifield (1963) J^. 
Amer. Chem. Soc. 85:2149-2456; Atherton et al. (1989) Solid 
Phase Peptide Synthesis: A Practical Approach , IRL Press, 
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Oxford; and Merrifield (1986) Science 232:341-347; each of 
which is hereby incorporated herein by reference. 

The recombinant nucleic acid sequences used to 
produce fusion proteins of the present invention may be derived 
5 from natural or synthetic sequences. Many natural gene 
sequences are available from various cDNA or from genomic 
libraries using appropriate probes, see, e.g., GenBank 1 *, 
National Institutes of Health". 

Typical probes for isolating platelet-derived growth 

10 factor receptor genes may be selected from sequences of Tables 
1 and 2, in accordance with standard procedures. Suitable 
synthetic DNA fragments may be prepared, e.g., by the 
phosphoramidite method described by Beaucage and Carruthers 
(1981) Tetra. Letts. 22:1859-1862. A double stranded fragment 

15 may then be obtained by either synthesizing the complementary 
strand and hybridizing the strands together under appropriate 
conditions or by adding the complementary strand using DNA 
polymerase with an appropriate primer sequence. 

20 III. Nucleic Acids 

The present invention provides nucleic acid sequences 
encoding various PDGF receptor sequences described above. 
Tables 1 and 2, respectively set forth the corresponding cDNA 
sequences encoding human type B and type A PDGF receptor 
25 polypeptides. 

Substantial homology in the nucleic acid context 
means either that the segments, or their complementary strands, 
when compared, are the same when properly aligned, with 
appropriate nucleotide insertions or deletions, in at least 
30 about 60% of the residues, typically at least about 70%, more 
typically at least about 80%, usually at least about 90%, and 
more usually at least about 95 to 98% of the nucleotides. 
Appropriate nucleotide insertions or deletions include 
interdomain sequences, or those external to the cysteines 
35 within a domain, but the sequences within the paired cysteines 
(or their equivalents in the D4 domains) will often be very 
important to retain. Structural homology will exist when there 
is at least about 55% homology over a stretch of at least about 
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14 nucleotides, typically at least about 65%, more typically at 
least about 75%, usually at least about 90%, and more usually 
at least about 95% or more. 

Alternatively, substantial homology exists when the 
5 segments will hybridize under selective hybridization 

conditions, to a strand, or its complement, typically using a 
sequence of at least about 20 contiguous nucleotides derived 
from Table 1 or 2. However, larger segments would usually be 
preferred, e.g., at least about 30 contiguous nucleotides, more 
10 usually at least about 40, and preferably more than about 50. 
Selectivity of hybridization exists when hybridization occurs 
which is more selective than total lack of specificity. See, 
Kanehisa (1984) Nucleic Acids Res* 12:203-213, which is 
incorporated herein by reference. 
15 Stringent hybridization conditions will normally 

include salt concentrations of less than about 1 M, typically 
less than about 7 00 mM, more typically less than about 500 inM, 
usually less than about 400 mM, more usually less than about 
300 mM, and preferably less than about 200 mM. Temperature 
20 conditions will typically be greater than about 20 •C, more 

typically greater than about 25 *C, usually greater than about 
30 'C, more usually greater than about 37 *C, and preferably in 
excess of about 40 # C, depending upon the particular 
application. As other factors may significantly affect the 
25 stringency of hybridization, including, among others, base 

composition and size of the complementary strands, presence of 
organic solvents, and extent of base mismatching, the 
combination of parameters is more important than the absolute 
measure of any one. 
30 Probes may be prepared based on the sequence of the 

PDGF receptor encoding sequences provided in Tables 1 and 2. 
The probes may be used to isolate other PDGF receptor nucleic 
acid sequences by standard methods. See, e.g., Sambrook et al. 
(19B9) Molecular Cloning: A Laboratory Manual , vols. 1-3, CSH 
35 Press, N.Y., which is hereby incorporated herein by reference. 
Other similar nucleic acids may be selected for by using 
homologous nucleic acids. Alternatively, nucleic acids 
encoding these same or similar receptor polypeptides may be 
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synthesized or selected by making use of the redundancy in the 
genetic code. Various, codon substitutions may be introduced, 
e.g., silent changes thereby providing various convenient 
restriction sites, or to optimize expression for a particular 
5 system r e.g., to match the optimum codon usage. Mutations may 
be introduced to modify the properties of the receptors, 
perhaps to change the ligand binding affinities, the inter- 
chain affinities, or the polypeptide degradation or turnover 
rate. 

10 The DNA compositions of this invention may be derived 

from genomic DNA or cDNA, prepared by synthesis or may be a 
hybrid of the various combinations. Recombinant nucleic acids 
comprising sequences otherwise not naturally occurring in 
continuity are also provided by this invention. An isolated 

15 DNA sequence includes any sequence that has been obtained by 
primer or hybridization reactions or subjected to treatment 
with restriction enzymes or the like. 

Synthetic oligonucleotides can be formulated by the 
triester method according to Matteucci et al. (1981) J . Am., 

20 Chem. Soc. 103:3185 or by. other methods such as commercial 

automated oligonucleotide synthesizers. Oligonucleotides can 
be labeled by excess polynucleotide kinase (e.g., about 10 
units to 0.1 nanomole substrate is used in connection with 50 
DM Tris,. pH 7.6, 5 mk dithiothreitol , 10 mM MgCl 2 , 1-2 mM ATP, 

25 1.7 pmoles 32 P-ATP (2.9 mCi/mmole) o.l mM spermidine, O.i mM 

EDTA) . . Probes may also be prepared by hick translation, KLenow 
fill-in reaction, or other methods known in the art. . See, 
e.g. , Sambrook et al. 

cDNA or genomic libraries of various types may be 

30 screened for new alleles or related sequences. The choice of 
cDNA iibraries normally corresponds to a tissue source which is 
abundant in mRNA for the desired receptors. Phage libraries 
are normally preferred, but plasmid libraries may also be used. 
Clones of a library are spread onto plates, transf erred to. a 

35 substrate for screening, denatured, and probed for the presence 
of desired sequences. 

For example, with a plaque hybridization procedure, 
. each plate containing bacteriophage plaques is replicated onto 



WO 92/13867 



PCT/US92/00730 



49 

duplicate nitrocellulose filter papers (Millipore-HATF) . The 
phage DNA is denatured with a buffer such as 500 mM NaOH, 1.5 M 
NaCl for about 1 minute, and neutralized with, e.g., 0.5 M 
Tris-HCl, pH 7.5, 1.5 M NaCl (3 times for 10 minutes each). 
The filters are then washed. After drying, the filters are 
typically baXed, e.g., for 2 hours at 80 *c in a vacuum oven. 
The duplicate filters are prehybridized at 42 *C for 4-24 hours 
with 10 ml per filter of DNA hybridization buffer (20-50% 
formamide, 5X SSC, pH 7.0, 5X Denhardt's solution 
(polyvinylpyrrolidone, plus Ficoll and bovine serum albumin? 
IX = 0.02% of each), 50 mM sodium phosphate buffer at pH 7.0, 
0.2% SDS, and 50 Mg/al denatured salmon sperm DNA) . 
Hybridization with an appropriate probe may be performed at 
42 'C for 16 hrs with 10 ml/filter of 1 x 10* cpm/ml of DNA 
hybridization buffer containing radioactively labeled probe. 
The final concentration of formamide is varied according to the 
length of the probe and the degree of stringency desired. See, 
e.g., Wetmur and Davidson (1968) J. Mol. Biol. 31:349-370; and 
M. Kanehisa (1984) Nuc. Acids Rg«. non^oi^ each of which is 
incorporated herein by reference, for a discussion of 
hybridization conditions and sequence homology. 

An oligonucleotide probe based on the disclosed amino 
acid sequences may be used to site specifically mutate or 
generate recombinant fusion or deletion constructs. See, e.g., 
Tables 11 and 12 for preferred oligonucleotide reagents. 
Procedures such as those described by Kimbel et al. (1987) 
tf etfrods jp Sn Z V7no T loqY : 3 67 , may t>e used. The sequences PA1 
through PA9 correspond to Seq. ID No. 6 through 14, 
respectively, and sequences PA101 through PA109 correspond to 
Seq. ID No. 15 through 23, respectively. 



WO 92/13867 

50 



PCT/US92/OO730 



PA1 



TABLE 11 

HDHAN B-type. PDGF-R MDTM35HESIS OUGOHERS 

nomain 5 / 3 •NonCoding 

5 . CCA CAC TCcfSTG CCC TTT AAG / . 3 



PA2 5 1 



TCC TTC GAC^CAG ATC AAT \ TAGCTTCCTGTAGGGGGCTG 



remain 3 / 3 •NonCoding 

M 5. ATC ACC GT^f^ WC GGC '/ TAGCTTCCTGTAGGGGGCTG 3« 



Domain 2 / 3' NonCoding 

PA4 5 . TAC AGA CTC CAG GTG TCA TCC / TAGCTTCCTGTAGGGGGCTG 3 ■ 

YRLQVSS/" 



rv™*™ l / 3»NonCoding 

PA5 5 . CTC TAC ATC^^G CCA GAT CCC TAGCTTCCTGTAGGGGGCTG 3- 

L Y IF VP DP / 



Q ' I S Q G:LV/I N v 



Signal Sequence : Domain 1 / Jjl? 11 ^ L r nTA AGT 3' 

PA9 5' CAG ATC TCT CAG GGC : CTG GTC / GTC CGA GTG CTG.GAG CTA AGT 
Q I S Q G : I* V /v k v 
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TABX£ 12 

PROPOSED HUMAN A-type PDCF-R KDTAGEHESIS OLIGOMERS 

Domain 5 / 3'NonCoding 

PA101 5" GCT CCC ACC CTG CGT TCT GAA / TAACTGGCGGATTCGAGGGG 
APTLRSE / * ********** 



Domain 4 / 3*NonCoding 

PA102 5' GAA CTG TTA ACT CAA GTT CCT / TAACTGGCGGATTCGAGGGG 
ELLT QVP/ * ********** 



Domain 3 / 3'NpnCoding 

PA103 5* ATT TCT GTC CAT GAG AAA GGT / TAACTGGCGGATTCGAGGGG 
I S "V H E K G / * ********** 



Domain 2 / 3'NonCoding 

PA104 5» TAT GCT TTA AAA GCA ACA TCA / TAACTGGCGGATTCGAGGGG 3' 
YALKATS/* ********** 



Domain 1 . / 3*NonCoding 

PA105 5» ATT TAC ATC TAT GTG CCA GAC CCA / TAACTGGCGGATTCGAGGGG 3' 
I y I Y V P D P/* ********** 



Signal Sequence : Domain 1 / Domain 2 

PA106 5 » AGC CTA ATC CTC TCC CAG CTT / GAT GTA GCC TTT GTA CCT CTA GGA 3' 
$ L I L C:Q i*/D V A F V P L G 



Signal Sequence : Domain 1 / Domain 3 

PA107 5 • AGC CTA ATC CTC TGC CAG CTT/ GAG CTG GAT CTA GAA ATG GAA GCT CTT 3 1 
S L I L CJQ L/E L D L E M E A L 



Signal Sequence : Domain 1 / Domain 4 

PA108 5 1 AGC CTA ATC CTC TGC CAG CTT / TTC ATT GAA ATC AAA CCC ACC TTC 3* 
S L I L C:Q L/F I E I K P T F 



Signal Sequence : Domain 1 / Domain 5 

PA109 5 1 AGC CTA ATC CTC TGC CAG CTT / TCA TCC ATT CTG GAC TTG GTC 3 1 
S L I L C:Q L/S S I L D L V 
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in accordance with this invention any isolated DNA 
sequence which encodes substantially a PDGF-R complete 
structural sequence can be used as a probe. Alternatively, any 
DNA sequence that encodes a PDGF-R hydrophobic signal sequence 
and its translational start site may be used. An isolated 
partial DMA sequence which substantially encodes intact domains 
exhibiting PDGF-R activity (e.g., ligand or PDGF-R binding) is 
also part of this invention. Preferred probes are cDNA clones 
of PDGF receptor polypeptides. 

The DNA sequences used in this invention will usually 
comprise intact domain structures, typically at least about 5 
codons (15 nucleotides) , more typically at least about 9 
codons, usually at least about 13 codons, more usually at least 
about 18 codons, preferably at least about 25 codons and more 
preferably at least about 35 codons. One or more introns may 
also be present. This number of nucleotides is usually about 
the minimal length required for a successful probe that would 
hybridize specifically with a PDGF receptor sequence. For 
example, epitopes characteristic of a PDGF-R may be encoded in 
short peptides. Usually the wild-type sequence will be 
employed, in some instances one or more mutations may be 
.introduced, such as deletions, substitutions, insertions, or 
inversions. These modifications may result in changes in the 
•amino acid sequence, provide silent mutations, modify a 
restriction site, or provide specific mutations. The genomic 
sequence will usually not exceed about 200 kb, more usually not 
exceed about 100 Kb, preferably not greater than about 0.5 kb. 

Portions of the DNA sequence having at least about 10 
nucleotides from a DNA sequence encoding an PDGF receptor 
peptide will typically be used, more typically at least about 
15 nucleotides, usually at least about 20 nucleotides, more 
usually at least about 25 nucleotides, and preferably at least 
about 30 nucleotides. The probes will typically be less than 
about 6 kb, usually fewer than about 3.0 kb, and preferably 
less than about 1 kb. The probes may also be used to determine 
whether mRNA encoding a specific PDGF-R is present in a cell, or 
different tissues. 




WO 92/13867 PCT/US92/00730 

The natural or synthetic DNA fragments coding for a 
desired platelet-derived growth factor receptor fragment will 
usually be incorporated into DNA constructs capable of 
introduction to and expression in an in. vitro cell culture* 
5 Often the DNA constructs will be suitable for replication in a 
unicellular host, such as yeast or bacteria, but may also be 
intended for introduction to, with and without integration 
within the genome, cultured mammalian, or plant or other 
eukaryotic cell lines. Human cells may be preferred hosts. 

10 Higher eukaryote host cells will often be preferred because 

their glycosylation and protein processing patterns more likely 
simulate human processing. DNA constructs prepared for 
introduction into bacteria or yeast will typically include a 
replication system recognized by the host, the intended DNA 

15 fragment encoding the desired receptor polypeptide construct, 
transcriptional and translational initiation regulatory 
sequences operably linked to the polypeptide encoding segment, 
and transcriptional and translational termination regulatory 
sequences operably linked to the polypeptide encoding segment. 

20 The transcriptional regulatory sequences will typically include 
a heterologous enhancer or promoter which is recognized by the 
host. The selection of an appropriate promoter will depend 
upon the host, but promoters such as the trp, lac, and phage 
promoters, tRNA promoters, and glycolytic enzyme promoters are 

25 known and available. See, e.g., Sambrook et al. (1989). 

Conveniently available expression vectors which include the 
replication system and transcriptional and translational 
regulatory sequences together with the insertion site for the 
platelet-derived growth factor receptor DNA sequence may be 

30 employed. Examples of workable combinations of cell lines and 
expression vectors are described, e.g., in Sambrook et al. 
(1989); see also, Metzger et al. (1988) Nature 334:31-36. 

Expression vectors for these cells can include 
expression control sequences, such as an origin of replication, 

35 a promoter, an enhancer and necessary processing information 
sites, e.g., ribosome-binding sites, RNA splice sites,, 
polyadenylation sites, and transcriptional terminator 
sequences. Preferably, the enhancers or promoters will be 
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those naturally associated with genes encoding the PDGF 
receptor polypeptides, although it will be understood that in 
many cases others will be equally or more appropriate . Other 
preferred expression control sequences are enhancers or ^ 
5 promoters derived from viruses, such as SV40, Adenovirus, 
Bovine Papilloma Virus, and the like. 

Similarly , preferred promoters are those found 
naturally in immunoglobulin-producing cells, see, e.g., U.S. 
Patent No. 4,663,281, which is incorporated herein by 

10 reference, but SV40, polyoma virus, cytomegalovirus (human or 
murine) and the LTR from various retroviruses, e.g., murine 
leukemia virus, murine or Rous sarcoma virus and HIV, may be 
utilized, as well as promoters endogenous to PDGF-R genes. 
See, Enhancers and Eukarvotic Gene Expression . (1983) Cold 

15 Spring Harbor Press, N.Y. , which is incorporated herein by 
reference. 

The vectors containing the DNA segments of interest, 
e.g., a PDGF receptor polypeptide gene or cDNA sequence, can be 
transferred into the host cell by well-known methods, which 

20 vary depending on the type of cellular host. For example, 
calcium chloride transfection is commonly utilized for 
prokaryotic cells, whereas calcium phosphate treatment may be 
used for other cellular hosts. See generally, Sambrook et al. 
(1989) Molecular Cloning: A Laboratory Manual (2d ed.) CSH 

25 Press, which is incorporated herein by reference. The term 
"transformed cell" is meant to also include the progeny of a 
transformed cell. 

As with the purified polypeptides, the nucleic acid - 
segments associated with the ligand-bihding segment, the 

30 extracellular domain, and the intracellular domain are 

particularly useful. These gene segments will be used as 
probes for screening for new genes exhibiting similar 
biological activities, though the controlling elements of these 
genes may also be of importance. 



35 
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IV. Methods for Making PDGF Receptor Polypeptide 
Constructs 

DNA sequences may also be used to express PDGF-R 
polypeptides* For example, a DNA sequence of from about 21 
5 nucleotides (encoding about 7 amino acids) to about 2.1 kb- 
(about 700 amino acids) may be used to express a polypeptide 
having a PDGF receptor specific activity, typically 
ligand-binding . In particular, constructs retaining the ligand 
binding regions will be useful, as these constructs will 

10 possess binding activity. 

In particular, various synthetic linkers and probes 
may be constructed to facilitate genetic engineering of the 
PDGF-R nucleic acid sequences. Polymerase chain reaction (PCR) 
techniques can be applied to producing large quantities of 

15 fragments or segments useful in the proper manipulation of the 
sequences encoding the constructs. See, e.g., Innis et al. 
(1990) PCR Protocols . Academic Press. Alternatively, nucleic 
acid synthesizers can produce sufficiently large quantities of 
fragments for hybridizing to any preselected sequence, e.g., 

20 from Table 1 or 2, or for manipulating the sequence to add or 
delete specific domains or. segments. Particularly important: 
segments will be the LBR's. 

Large quantities of the receptor proteins may be 
prepared by expressing the whole receptor or parts of the 

25 receptor contained in the expression vehicles in compatible 

hosts such as E. col i P - yeast f mammalian cells, insect cells, or 
frog oocytes. The expression vehicles may be introduced into 
the cells using methods, well known in the art such as calcium 
phosphate precipitation (discussed below) , lipofectin 

30 electroporation, or DEAE dextran transformation. 

Usually the mammalian cell hosts will be immortalized 
cell lines. To study the characteristics of a PDGF-R and its 
corresponding ligand, it will be useful to trans feet, or 
transform mammalian cells which lack or have low levels of a 

35 PDGF receptor. Preferably, a signal sequence can serve to 
direct the peptide to the cell membrane or for secretion. 
Cells lacking significant amounts of PDGF receptors include 
Chinese hamster ovary (CHO) cells, most epithelial cell lines, 
and various human tumor cell lines. 
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Transformed or transfected cells can be selected 
which incorporate a DMA sequence which encodes a receptor that 
is functionally equivalent to a wild-type receptor thereby 
conferring a PDGF-sensitive mitogenic response. Such cells 
will enable the analysis of the binding properties of various 
added PDGF receptor polypeptides. Transfected cells nay also 
be used to evaluate the effectiveness of a composition or drug 
as a PDGF antagonist or agonist. The level of receptor 
tyrosine kinase activity or the rate of nucleic acid synthesis 
can be determined by contacting transfected cells with drugs or 
. ligands and comparing the effects of various ligand analogues 
against the controls. Although the most common procaryote 
cells used as hosts are strains of E. coli, other prokaryotes 
such as subtilis or Pspudomonas may also be used. The 

5 DNA sequences of the present invention, including fragments or 
portions of the sequence encoding for receptor polypeptides 
comprising intact structural domains, a portion of the. 
receptor, or a polypeptide having an PDGF-R activity, can be 
used to prepare an expression vehicle or construct for a PDGF-R 
20 polypeptide or polypeptide having a PDGF-R activity. Usually 
the control sequence will be a eukaryotic promoter for 
expression in a mammalian cell. In some vehicles the 
receptor's own control sequences may also be used. A common 
prokaryotic plasmid vector for transforming E. coli is pBR322 
25 or its derivatives, e.g. the plasmid pkt279 (Clontech) , see 

Bolavar et al. (1977) Gene . 2:95. The prokaryotic vectors may 
also contain prokaryotic promoters for transcription 
initiation, optionally, with an operator. Examples of most 
commonly used prokaryotic promoters include the beta-lactamase 
30 (penicillinase) ; lactose (lac) promoter, see Cheng et al. 
(1977) Nature . 198:1056; tryptophan promoter (trp) , see 
Goeddell et al. (1980) und^e Acid Res... 8: 457); P^ promoter; 
and the N-gene ribosome binding site, see Shimatake et al. 
(1981) Nature . 292:128-; each of which is hereby incorporated 
35 herein by reference. 

Promoters used in conjunction with yeast can be 
promoters derived from the enolase gene, see Holland et al. 
(1981) .t_ Biol. Chem. . 256:1385 ; or the promoter for the 
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synthesis of glycolytic enzymes such as 3 -phosphogly cerate 
kinase, see Hitzeman et al. (1980) J. Biol. Chem. , 255:. 

Appropriate non-native mammalian promoters will 
include the early and late promoters from SV40, see Fiers et 
5 al. (1978) Nature . 273:113; or promoters derived from murine 
muloney leukemia virus, mouse mammary tumor virus, avian 
sarcoma viruses, adenovirus II, bovine papilloma virus, or 
polyoma. In addition, the construct may be joined to an 
amplifiable gene, e.g. dihydrofolate reductase (DHFR) so that 

10 multiple copies of the PDGF receptor gene may be made. See, 
e.g., Kaufman et al. (1985) Mol. and Cell. Biol. 5:1750-1759; 
and Levinson et al. EPO publication nos. 0117059 and 0117060, 
each of which is incorporated hereby by reference. 

Prokaryotes may be transformed by various methods, 

15 including using CaCl 2 , see Cohen (1972) Proc. Nat'l Acad. Sci. 
USA, 69:2110; or the RbCl method, see Maniatis et al. (1982) 
Molecular Cloning: A Laboratory Manual , Cold Spring Harbor 
Press. Yeast may be transformed, e.g., using, a method 
described by Van Solingen et al. (19771 J. Bacteriol. 130:946; 

20 or Hsiao et al. (1979) Proc. Nat'l Acad. Sci. USA 76:3829. 

With respect to eukaryotes, mammalian cells may be transfected 
using a calcium phosphate precipitation method, see, e.g., 
Graham and van der Kb (1978) Virology , 52:546; or by lipofectin 
(BRL) or retroviral infection, see, e.g., Gilboa (1983) 

25 Experimental Manipulation of Gene Expression . Chap. 9, Academic 
Press P. 175. The actual expression vectors containing 
appropriate sequences may be prepared according to standard 
techniques involving ligation and restriction enzymes. See 
e.g., Maniatis supra . Commercially available restriction 

30 enzymes for cleaving specific sites of DNA may be obtained from 
New England BioLabs, Beverly, Massachusetts. 

Particular cotransformations with other genes may be 
particularly useful. For example, it may be desired to co- 
express the nucleic acid with another processing enzyme. Such 

35 enzymes include signal peptidase, tertiary conformation 

conferring enzymes, or glycosylating enzymes. This expression 
method may provide processing functions which otherwise might 
be lacking in the expression host, e.g., mammalian-like 
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glycosylation in a prokaryote expression system- 
Alternatively, the host cell selected for expression may be 
chosen on the basis of the natural expression of those 

processing enzymes - 

cell clones are selected by using markers depending 
on the mode of the vector construction- The marker may be on 
the same or a different DNA molecule preferably the same DNA 
molecule- With mammalian cells the receptor gene itself may be 
the best marker. In prokaryotic hosts the transformant may be 
selected by resistance to ampicillin, tetracycline, or other 
antibiotics. Production of a particular product based on 
temperature sensitivity or compensation may serve as 
appropriate markers. Various methods may be used to harvest 
and purify the PDGF-R receptor protein or peptide fragment. 
15 The peptide may be isolated from a lysate of the host- The 
peptide may be isolated from the cell supernatant if the 
peptide is secreted- The PDGF-R peptide is then further 
purified as discussed above using HPI*C , electrophoresis , or 
affinity chromatography, e.g. , immuno-aff inity or ligand 
20 affinity. 

Another method which can be used to isolate cDNA 
clones of PDGF-R related species involves the use of the 
polymerase chain reaction (PCR) . See, e.g.., Saiki et- al. 
(1985) science 230:1350- In this approach two oligonucleotides 
25 corresponding to distinct regions of the PDGF-R sequence are 
synthesized and then used in the PCR reaction, typically to 
amplify receptor-related mRNA transcripts from an mRNA source. 
Annealing of the oligonucleotides and PCR reactions are 
performed under conditions of reduced stringency- The 

30 resulting amplified fragments are subcloned, and the resulting 
recombinant colonies are probed with 32 P-labeled full-length 
PDGF-R cDNA. Clones which hybridize under low but not high 
stringency conditions represent PDGF-R related mRNA 
transcripts- This approach can also be used to isolate variant 

35 PDGF-R cDNA species which arise as a. result of alternative 

splicing, see Frohman et al- (1988) Proc. Nat '3 Acad. Sci. USA , 
85:8998- 
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V. Antibodies 

Polyclonal and/or monoclonal antibodies to the 
various PDGF receptor constructs, receptor peptides, and 
peptide fragments may also be prepared. Peptide fragments may 
5 be prepared synthetically in a peptide synthesizer and coupled 
to a carrier molecule (i.e. , keyhole limpet hemocyanin) and 
injected into rabbits over several months. The rabbit sera is 
tested for immunoreactivity to the PDGF receptor protein or 
fragment. Monoclonal antibodies may be made by injecting mice 

10 with PDGF-R protein , PDGF— R polypeptides, or mouse cells 

expressing high levels of the cloned PDGF receptor on its cell 
surface* Monoclonal antibodies will be screened by ELI S A and 
tested for specific immunoreactivity with the PDGF receptor 
protein or polypeptides thereof. See, Harlow and Lane (1988) 

15 Antibodies; A Laboratory Manual .. -CSHarbor Press, which is 

hereby incorporated herein by reference. These antibodies will 
be useful in assays as well as pharmaceuticals. 

Once a sufficient quantity of the desired PDGF 
receptor polypeptide construct has been obtained, the protein 

20 may be used for various purposes. A typical use is the 
production of antibodies specific for binding to epitopes 
characteristic of these receptors. These antibodies may be 
either polyclonal or monoclonal and may be produced by in vitro 
or in vivo techniques , 

25 For production of polyclonal antibodies, an 

appropriate target immune system is selected, typically a mouse 
or rabbit. The substantially purified antigen is presented to 
the immune system in a fashion determined by methods 
appropriate for the animal and other parameters well known to 

30 immunologists . Typical sites for injection are in the 

footpads, intramuscularly, intraperitoneal ly, or intradermally . 
Of course, another species may be substituted for a mouse or 
rabbit, typically a mammal, but possibly a bird or other 
animal. 

35 An immunological resporfse is usually assayed with an 

immunoassay. Normally such immunoassays involve some 
purification of a source of antigen, for example, produced by 
the same cells and in the same fashion as the antigen was 
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produced. The immunoassay may be a radioimmunoassay, an 
enzyme-linked assay (ELISA) , a fluorescent assay, or any of 
many other choices, most of which are functionally equivalent 
but may exhibit particular advantages under specific 

5 conditions. 

Monoclonal antibodies with affinities of at least 
about 10 6 M" 1 preferably 10 s ' 10 1Q , or higher will be made by 
standard procedures as described, e.g., in Harlow and Lane, 
(1988) Antibodies; A Laboratory Manual . CSH Press; or Goding, 

10 (1986) Monoclonal Antibodies: Prin ciples and Practice (2d ed) 
Academic Press, New York, which are hereby incorporated herein 
by reference. Briefly, appropriate animals will be selected 
and the desired immunization protocol followed. After the 
appropriate period of time, the spleens of such animals are 

15 excised and individual spleen cells fused, typically, to 
immortalized myeloma cells under appropriate selection 
conditions. Thereafter the cells are clonally separated and 
the supernatants of each clone are tested for their production 
of an appropriate antibody specific for the desired region of 

20 the antigen. 

Other suitable techniques involve in vitro exposure 
of lymphocytes to the antigenic polypeptides or alternatively 
to selection of libraries of antibodies in phage or similar 
vectors. See . Huse et al. "Generation of a Large Combinatorial 

25 Library of the Immunoglobulin Repertoire in Phage Lambda," 
Science 246:1275-1281 (1989), hereby incorporated herein by 
reference. The polypeptides and antibodies of the present 
invention may be used with or without modification. 
Frequently, the polypeptides and antibodies will be labeled by 
. 30 joining, either covalently or non-covalently, a substance which 
provides for a detectable signal. A wide variety of labels and 
conjugation techniques are known and are reported extensively 
in both the scientific and patent literature. Suitable labels 
include radionuclides, enzymes, substrates, cof actors, 

35 inhibitors, fluorescens, chemiluminescers , magnetic particles 

and the like. Patents, teaching the use of such labels include 
U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 
4,277,437; 4,275,149; and 4,366,241. Also, recombinant 
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immunoglobulins may be produced, see Cabilly, U.S. Patent 
No. 4,816,567. 

Antibodies of particular interest are those raised 
against the ligand binding regions. These will include some 
5 antibodies which function as ligands. Or, antibodies may be 
used to select for compounds which could serve as ligands for 
modified receptors. See, e.g., Meyer (1990) Nature 347:424- 
425? and Pain et al. (1990) Nature 347:444-447; each of which 
is hereby incorporated herein by reference. 

10 

VIII. Methods for Use 

The present invention provides platelet-derived 
growth factor receptor (PDGF-R) polypeptide purification 
methods as.well as methods for synthesizing PDGF receptors 

15 within cells. Also provided are homogeneous receptors produced 
by these methods, nucleic acid sequences encoding the receptors 
or portions of the receptors, as well as expression vehicles 
containing these sequences, cells comprising the PDGF- 
receptors, and antibodies to the' receptors. In particular, the 

20 present invention provides methods for assaying binding and 
other activities of receptor- like proteins having rearranged 
combinations of the domains. 

The extracellular region of the human type B PDGF 
receptor protein has been used to successfully bind PDGF BB 

25 ligand in a receptor activation assay. PDGF BB ligand binding 
to NIH3T3 cell-associated PDGF receptors is measured. Ligand 
binding causes phosphorylation (activation) of the cell 
associated receptors. Receptor phosphorylation is followed in 
a multi-step process which first involves solubilization of 

30 NIH3T3 cells and separation of cell proteins by electrophoresis 
of cell extracts on sodium dodecyl sulfate polyacrylamide gels. . 
Gels are blotted onto nitrocellulose and treated with anti- 
phosphotyrosine monoclonal antibodies to aid in the detection 
of phosphorylated PDGF receptor. Monoclonal antibodies are 

35 visualized through autoradiography of antibody-associated 125-1 
protein A which has been introduced at the terminal stage of 
the assay. 
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If human type B receptor protein (at about a 60 fold 
molar excess to PDGF BB ligand) is preincubated with ligand for 
1 hour prior to incubation with NIH3T3 cells, there is no cell- 
associated PDGF receptor phosphorylation. This indicates that 
5 the human type B PDGF receptor protein binds PDGF BB ligand in 
solution and prevents the ligand from activating cell- 
associated PDGF receptors. Thus, polypeptides which contain 
LBR's may be used to block normal PDGF responses. 

The domain containing structures of the present 
10 invention will find use both as diagnostic and therapeutic 
reagents. The receptor polypeptides may be used as affinity 
reagents for detecting or binding ligand, as well as for 
interacting with receptor-like proteins, e.g., affecting 
receptor protein dimerization. The polypeptides will also be 
IS useful as reagents for detecting or purifying other proteins 
which associate with the receptors or fragments thereof. 

The receptor polypeptides will also find use in 
generating other reagents, e.g., antibodies specific for 
binding epitopes peculiar to the modified receptors. In 
20 particular, antibodies raised against newly formed ligand 
binding determining segments may serve as ligands for the 
modified receptors. These techniques may provide for 
separating various functionalities of the receptors, thereby 
isolating each of the different effector functions from others, 
25 in response to PDGF binding. 

The modified receptors of the present invention also 
provide methods for assaying ligands for them. For example, 
soluble ligand binding fragments will be useful as competing 
sites for ligand binding, a useful property in a ligand binding 
30 assay. In particular, the present invention provides an assay 
to screen for PDGF binding inhibition, allowing screening of 
large numbers of compounds. These compounds may be assayed in 
vitro, which allows testing of cytotoxic or membrane disruptive 
. compounds. The present solid phase system allows reproducible, 
35 sensitive, specific, and readily automated assay procedures. 
Polystyrene 96-well plates may be coated with the appropriate 
construct with LBR's to assay for ligand binding activity. 
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Moreover, modifications to the ligand binding domains 
will lead to binding region combinations with different ligand 
binding affinities. Thus, modulation of ligand effected 
response may be easily achieved by inclusion of the appropriate 

5 affinity modified analogue. 

Solid phase assays using these modified receptors may 
also be developed, providing greater sensitivity or improved 
capacity over unmodified binding regions. 

Diagnostic kits comprising these reagents are also 

10 provided. The kit typically comprise a compartmentalized 

enclosure, e.g., a plastic substrate having diagnostic reagents 
of the invention attached thereto. The package will typically 
also include various buffers, labeling reagents, and other 
reagents as appropriate for the diagnostic test to be 

15 performed. Instructions for use of the related reagents and 
interpretation of the results will be provided. 

In particular, the important functional segment of 
the extracellular domain will usually be attached to a plastic 
or other solid phase substrate. The binding regions will 

20 usually be selected for a combination of the affinity and 
ligand binding spectrum of the modified binding segments. 
Appropriate ligands will often be introduced to determine the 
ligand binding, activity and affinity. Different LBR 
combinations will be used, and can be used to test for 

25 differently modified, e.g., labeled, ligands. 

-In addition, the peptides wili.be useful for 
therapeutic administration. The quantities of reagents 
necessary for effective therapy will depend upon many different 
factors, including means of administration, target site, 

30 physiological state of the patient, and other medicants 

administered. Thus, treatment dosages should be titrated to 
optimize safety and efficacy* Typically, dosages used in vitro 
may provide useful guidance in the amounts useful for in situ 
administration of these reagents.. Animal testing of effective 

35 doses for treatment of particular disorders will provide 
further predictive indication of human dosage. Various 
considerations are described, e.g., in Gilman et al. (eds) , 
(1990) Goodman and Gilman 's: The Pharmacological Basis of 
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Therapeutics , 8th ed. , Pergamon Press; and Remington's 
fVrr ,^tica l Sciences . (1985) 7th ed. , Mack Publishing Co- , 
Eastern, Penn. ; each of which is hereby incorporated by 
reference. Methods for administration are discussed therein, 
e.g., for oral, intravenous, intraperitoneal, or intramuscular 
administration, transdermal diffusion, and others. 
Pharmaceutically acceptable carriers will include water, 
saline, buffers, and other compounds described, e.g., in the 
Merck Index, Merck & Co., Rahway, New Jersey. Because of the 
high affinity binding between PDGF and its receptors, low 
dosages of these reagents would be initially expected to be 
effective. XnuSf dosage ranges would ordinarily be expected to 
be in amounts lower than 1 mM concentrations, typically less 
than about 10 /iM concentrations, usually less than about 
100 nM, preferably less than about 10 pM (picomolar) , and most 
preferably less than about 1 fM (femtomolar) , with an 
appropriate carrier. 

The pharmaceutical compositions will be administered 
by parenteral, topical, oral or local administration, such as 
by aerosol or transdermal ly, for prophylactic and/ or 
therapeutic treatment. The pharmaceutical compositions can be 
administered in a variety of unit dosage forms depending upon 
the method of administration. For example, unit dosage forms 
suitable for oral administration include powder, tablets, 
pills, capsules and dragees. 

Preferably, the pharmaceutical compositions are 
administered intravenously. Thus, this invention provides 
compositions for. intravenous administration which comprise a 
solution of the compound dissolved or suspended in an 
acceptable carrier, preferably an aqueous carrier. A variety 
of aqueous carriers may be used, e.g., water, buffered water, 
0.4% saline, and the like. These compositions may be 
sterilized by conventional, well known sterilization 
techniques, or may be sterile filtered. The resulting aqueous 
solutions may be packaged for use as is, or lyophilized, the 
lyophilized preparation being combined with a sterile aqueous 
solution prior to administration. The compositions may contain 
pharmaceutically acceptable auxiliary substances as required to 




WO 92/13867 PCT/US92/O0730 

65 

approximate physiological conditions, such as pH adjusting and 
buffering agents, tonicity adjusting agents, wetting agents and 
the like, for example, sodium acetate, sodium lactate, sodium 
chloride, potassium chloride, calcium chloride, sorbitan 
5 moholaurate, triethanol amine oleate, etc. 

For solid compositions, conventional nontoxic solid 
carriers may be used which include, for example, pharmaceutical 
grades of mannitol, lactose, starch, magnesium stearate, sodium 
saccharin, talcum, cellulose, glucose, sucrose, magnesium 

10 carbonate, and the like. For oral administration, a 

pharmaceutical^ acceptable nontoxic composition is formed by 
incorporating any of the normally employed excipients, such as 
those carriers previously listed, and generally 10-95% of 
active ingredient, preferably about 20% ( see . Remington 1 s . 

15 supra ) • 

For aerosol administration, the compounds are 
preferably supplied in finely divided form along with a 
surfactant and propellant. The surfactant must, of course,, be 
nontoxic, and preferably soluble in the propellant. 
20 Representative of such agents are the esters or partial esters 
of fatty acids containing from 6 to 22 carbon atoms, such as 
caproic, bctanoic, lauric, palmitic, stearic, linoleic, 
linolenic, olesteric and oleic acids with an aliphatic 
polyhydric alcohol or its cyclic anhydride such as, for 
25 example, ethylene glycol, glycerol, erythritol, arabitol, 
mannitol, sorbitol, the hexitol anhydrides derived from 
sorbitol, and the polyoxyethylene and polyoxypropylene 
. derivatives of these esters. Mixed esters, such as mixed or 
natural glycerides may be employed. The surfactant may 
30 constitute 0.l%-20% by weight of the composition, preferably 
0.25-5%. The balance of the composition is ordinarily 
propellant. Liquefied propellants are typically gases at 
ambient conditions, and are condensed under pressure. Among 
suitable liquefied propellants are the lower alkanes containing 
35 up to 5 carbons, such as butane and propane; and preferably 
fluorinated or fluorochlorinated alkanes. Mixtures of the 
above may also be employed • In producing the aerosol, a 
container equipped with a suitable valve is filled with the 
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appropriate propellant, containing the finely divided compounds 
and surfactant. The ingredients are thus maintained at an 
elevated pressure until released by action of the valve. 

The compositions containing the compounds can be 
administered for prophylactic and/or therapeutic treatments, 
in therapeutic applications, compositions are administered to a 
patient already suffering from a disease, as described above, 
in an amount sufficient to cure or at least partially arrest 
the symptoms of the disease and its complications. An amount 
adequate to accomplish this is defined as "therapeutically 
effective dose." Amounts effective for this use will depend on 
the severity of the disease and the weight and general state of 
the patient. 

In prophylactic applications, compositions containing 
the compounds of the invention are administered to a patient 
susceptible to or otherwise at risk of a particular disease. 
Such an amount is defined to be a "prophylactically effective 
dose." In this use, the precise amounts again depend on the 
patient's state of health and weight. 

The invention will better be understood by reference 
to the following illustrative examples. The following examples 
are offered by way of illustration and not by way of 
limitation. 

EXPERIMENTAL 

In general, standard techniques of recombinant DNA 
technology are described in various publications, e.g., 

Sambrook et al. (1989) Macular Cloning: A laboratory Manual , 

Cold Spring Harbor Laboratory; Ausubel et al. (1987) Current 
Protocols in Molecular Biology , vols. 1 and 2 and supplements; 
and Wu and Grossman (eds.) (1987) Methods in Engymology, Vol. 
53 (Recombinant DNA Part D) ; each of which is incorporated 
herein by reference. 

I. Human Extracellular Redion 

Equivalent techniques for construction, expression, 
and determination of the physiological effect of truncation or 
deletion analogues of the soluble extracellular receptor 
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fragments from the human receptor may be performed using the 
nucleic acid r polypeptide, and other reagents provided herein. 

A. Type B Segments 
5 Constructs of type B receptor polypeptides were made 

as follows: 

The 3.9 kb EcoRI-Hind III cDNA fragment of the human 
type B hPDGF-R was subcloned into the EcoRI-Hind III site of 
M13 Mpl8 to produce a vector Mpl8PR. For techniques, see 
10 Maniatis et al. (1982) Molecular Cloning: A Laboratory Manual , 
Cold Spring Harbor, N.Y., which is incorporated herein by 
reference. Verification of subcloning was performed by 
restriction enzyme digestion analysis and dideoxy chain 
termination sequencing, as described by Sanger et al. (1977) 
15 Proc. Nat'l Acad. Sci . USA 74:5463. Oligonucleotide directed 
in vitro mutagenesis was performed according to the method 
described by Kunkel et al. (1987) Methods in Enzvmol . . 154:367. 
The strategy for oligonucleotide directed in vitro deletion 
mutagenesis of MplSPR is outlined in Fig. 1. 
20 In brief, a series of oligonucleotides were designed 

. to create a nested set of soluble type B hPDGF receptor 
extracellular regions by deletion mutagenesis. These domains 
are designated Domain 1 through Domain 5 (D1-D5) , suitable for 
expression in an appropriate eukaryotic expression system. A 
25 description of the mutagenic oligonucleotides aligned with -the 
corresponding -regions of the human PDGF receptor are listed in 
Table 10. The resulting constructs are labeled as indicated in 
Table 13. The antisense strand was used for mutagenesis 
throughout. Mutagenesis of PA1, PA2, PA3, PA4 , and PAS , 
30 utilized MplSPR as the template and mutagenesis of PA6 , PA7, 

PA8, and PA9, utilized MP 18 PA1 as the template. PA1, a 41 bp 
oligomer, introduced a TAG stop codon after Lysine 499 (K 499 ) of 
D5 and removed the transmembrane (TM) as well as entire 
intracellular kinase domain (K) , producing an Mpl8 PA1 (see 
35 Fig. !)♦ PA1 codes for 530 148 precursor proteins. 



WO 92/13867 PCT/US92/OO730 

68 



20 



TABLE 13 

HUMAN TYPE B PDGF-R EXPRESSION CONSTRUCTS 



Soluble Membrane Bound 

pBJPR 



10 pBJPAl 
pBJPA2 
pBJPA3 
pBJPA4 
pBJPAS 

15 pBJPA6 
pBJPA7 
pBJPA8 
pBJPA9 



The human PDGF receptor constructs were subsequently 
25 subcloned into the EcoRI-Hind III site of pBJl a derivation of 
pCDL-SRa296 r as described in Takabe et al. (1988) Molec, Cell 
Biol. 8:466, and co-trans fected with pSV2NEO, as described by 
Southern and Berg (1982) J. Mol . A onl. Gen. . 1: 327 , into 
Chinese hamster ovary cells (CHO) . See Pigs. 2 and 3. 
30 Function of the constructs was . demonstrated, as 

follows: 

A sample of 0.3 3 nM PDGF BB ligand is preincubated 
for 1 hr at 4'C under the following conditions: 

1. a polyclonal antibody to human PDGF (this antibody 
35 recognizes human PDGF AA, PDGF BB and PDGF AB) ; 

2. 18 nM (60 fold molar excess to PDGF BB) human type B 
PDGF receptor; 

3. phosphate buffered saline solution that the receptor 
and antibody are in ; or 

40 4. no additions but the ligand itself. 

In a duplicate set of experiments, 0.33 nM PDGF AA is 
incubated with three of the above preincubation conditions, 
e.g., 2, 3, and 4 above. The human type B PDGF receptor does 
not appreciably recognize PDGF AA but this ligand will still 

45 activate cell-associated human type A PDGF receptor from NIH3T3 
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cells and so is a control for human type B PDGF receptor 
specificity and. PDGF BB-dependent activation versus non- 
specific general cellular effect, e.g., cytotoxicity. 

The preincubated materials were in a final volume of 
5 0.5 ml. They were placed in one well each of a six well tissue 
culture dish containing a confluent layer of serum starved 
(quiescent) NIH3T3 cells which were chilled to 4 f C. The cells 
and incubation mixtures were agitated, e.g., rocked, at 4*C for 
2 h. They were then washed twice with 4*C phosphate buffered 

10 saline. Forty pi of 125 mM Tris(hydroxymethyl) amino methane 
(Tris), pH 6.8, 20% (v/v) glycerol, 2% (w/v) sodium dodecyl 
sulfate (SDS), 2% (v/v) 2-mercaptoethanol, and 0.001% 
bromphenol blue, (known as SDS sample buffer) , was added per 
microtiter well followed by 40 m1 of 100 mM Tris, pH 8.0, 30 mM 

15 sodium pyrosphoshate, 50 mM sodium fluoride, 5 mM. 
ethylenediaminetetraacetic acid (EDTA) , 5 mM 

ethylenebis(oxyethylenenitrilio)tetraacetic acid, 1% (w/v) SDS, 
100 mM dithiothreitol, 2 mM phenylmethylsulf onylf luoride 
(PMSF), and 200 mM sodium vanadate was added to the cells. The 

20 cells were solubilized and 40 Ml additional SDS sample buffer 
was added to the solubilizate. This material was boiled 5 
minutes and loaded onto a single gel sample well of a 7.5% 
sodium dodecyl sulfate polyacrylamide gel. Cellular proteins 
were separated by electrophoresis. . 

25 The separated proteins were transferred to 

nitrocellulose by electrotransfer and the resulting "Western 
blot" was incubated with 3 changes of. 0.5% (w/v) sodium 
chloride, 5 mg/ml bovine serum albumin, 50 mM Tris, pH 7.5, 
(designated blocking buffer) for 20 minutes each at room 

30 temperature. A 1/1000 dilution of PY20 (a commercially 

available monoclonal antibody to phosphotyrosine [ICN]) in 
blocking buffer was incubated with the blot overnight at 4*C. 
The blot was washed 3 times for 20 minutes each at room 
temperature in blocking buffer. The blot was incubated with 4 

35 MCi/40 ml of 125 I-Protein A [Amersham] in blocking buffer for 1 
hour at room temperature and washed 3 times for 20 minutes each 
at room temperature in blocking buffer. The blot was exposed 
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to X-ray film for 48 h with one intensifying screen at -70 -c 
and developed with standard reagents. 

Figure 4 shows the results of the autoradiogram with 
the conditions mentioned above plus the additional condition of 
5 no added ligand (no PDGF) . This added condition defines the 
level of cell-associated receptor activation (e.g., 
phosphorylation) in the absence of any added ligand. Both the 
antibody and the human type B PDGF receptor neutralized the 
activation of cell-associated PDGF receptor by PDGF BB. This 
10 is apparently due to direct binding and sequestration of the 
ligand making it unavailable for PDGF receptor activation. 
pl85 shows the receptor position. 

B. Type A Sequence 

15 Similar manipulations using the mutagenic 

oligonucleotides of Table 12 are used to construct the type A 
constructs listed in Table 15. Note that the type A constructs 
have not actually been produced, but would readily be produced 
by these methods. Similar assays are used to test the function 

20 of the constructs. 



TABLE 15 

SUGGESTED HUMAN TYPE A PDGF— R EXPRESSION CONSTRUCTS 



25 



30 



type A 

Soluble MeTTibrane Bound 

pARSR 



pARSAl 
pARSA2 
pARSA3 
pARSA4 

35 pARSAS 

pARSA6 
pARSA7 
pARSA8 
pARSA9 

40 



45 
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C. PDGF Plate Assay 

Polystyrene microtiter plates (Immulon, Dynatech 



Laboratories) were coated with the extracellular region 
fragment of the type B human PDGF receptor (described above) by 
5 incubating approximately 10-100 ng ot this protein per well in 
100 jxl of 25 mM Tris, 75 mM NaCl, pH 7.75 for 12 to 18 h at 
4'C. The protein was expressed in transfected CHO cells and 
collected in serum-free media (Gibco MEMa) at a concentration 
of. 0.2 -1 jig/ml, with a total protein concentration of 
10 150 - 300 w/ia. 

The human PDGF type B receptor extracellular region 
fragment was concentrated and partially- purified by passing the 
media over wheat genn-agglutinin-sepharose at 4*C (at 48 ml/h) 
in the presence of 1 mM PMSF. After extensive washing/ the 
15 protein was eluted in 0.3 M N-acety 1 -glucosamine , 25 mM Hepes, 
100 mM NaCl, 1 mM PMSF, pH 7.4. This fraction was then applied 
to Sephacryl S-200 HR (Pharmacia) equilibrated in 0.15 M 
ammonium bicarbonate pH 7.9. The fractions containing . receptor 
(3 ^ 10 ng//il) were detected by SDS-PAGE and Western blotting 
20 with a polyclonal rabbit antibody, made by standard methods, 
against a Domain 1 (Dl) segment from the receptor external 
region. These fractions (3-10 ng/pl) were used to coat the 
microtiter wells as described above. The wells were then 
drained, rinsed once with 200 /il each of 0.5% gelatin (Bio-Rad f 
25 EIA grade), 25 mM Hepes, 100 mM NaCl, pH 7.4, and incubated for 
1*2 h at 24 *C with 150 /il of this same solution. The wells 
were drained and rinsed twice with 0.3% gelatin, 25 mM Hepes ; 
100 mM NaCl, pH 7.4 (150 /xl each). 90 Ml of the 0.3% gelatin 
solution was put in each well (wells used to test nonspecific 
30 binding received just 80 nl and then 10 ^1 of 0.01 mg/ml non- 
labeled PDGF in the 0,3% gelatin solution). PDGF BB (Amgen) 
was iodinated at 4*C to 52,000 CPM/ng with di-iodo Bolton- 
Hunter reagent (Amersham) and approximately 40,000 CPM was 
added per well in 10 pi, containing 0.024% BSA, 0.4% gelatin, 
35 20 mM Hepes, 80 mM NaCl, 70 mM acetic acid, pH 7.4. The plate 
was incubated for 2-3 h at 24 'C, after which wells were washed 
three times .with 150 pi each with 0.3% gelatin, 25 mM Hepes, 
100 mM NaCl, pH 7.4. The bound radioactivity remaining was 
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15 



solubilized from the veils in 200 ,1 1% «», 0.3% BSA, and 
counted in a gamma-counter. The nonspecific binding was 

^ _ ^ c a- -Fold excess of unlabeled 
determined in the presence of a 150 fold exce 

PDGF BB (Amgen) and was about 7% of the total bound. I-PDGF 
5 similar assays will be possible using type A receptor 

fragments. However, the type A receptor fragments are more 
sensitive to the presence of other proteins than the type B 
fragments, and appear to require a different well coating 
reagent from the gelatin. Hemoglobin is substituted for 
10 gelatin in the buffers at about the same concentrations, other 
blocking proteins will be useful selected from, e.g., the Sigina 
Chemical Company. Titrations to optimize the protein type and 
concentration will be performed to find proteins which do not 
affect the receptor protein binding. 

The present assays require less than 5 ng/well of 
receptor soluble form, which was expressed in transfected CHO 
cells, and partially purified by affinity and gel 
chromatography. Using iodinated PDGF-BB, the specific binding 
of less than 10 pg of ligand can be detected in an assay volume 
20 of 100 ^g/well. At 4-c. the binding of -I-PDGF BB to 

immobilized receptor is saturable and of high affinity. The Kd 
by Scatchard analysis was about 1 nM with 1.8 x 10 sites per 
well. The nonspecific binding, determined in the presence of a 
100-fold excess of cold PDGF BB, was usually only about 5-10% 
25 of the total binding. The binding was also specific for the 
isoform of the ligand. insofar as excess cold PDGF AA did not 
inhibit ^I-PDGF BB binding. Furthermore, the externa?, region 
of the type B PDGF receptor in solution competes with its 
immobilized form for binding iodinated PDGF BB (IC 50 = 5nM) . 
30 The ^I-PDGF- BB bound after 4 h at 4'C is only slowly 

dissociable in binding buffer (t l/2 > 6 h) , but is completely 
displaced by the addition of a 150-fold excess of unlabeled 

PDGF BB (t < 1 h). 

These studies were made possible by the availability 
35 of growth factor preparations devoid of contamination with 
other growth factors and by the use of a receptor expression 
system in which all of the measured PDGF responses could be 
attributed. to this single transfected receptor cDNA. 
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All publications and patent applications herein are 
incorporated by reference to the same extent as if each 
individual publication or patent application was specifically 
and individually indicated to be incorporated by reference. 
5 The invention now being fully described, it will be apparent to 
one of ordinary skill in the art that many changes and 
modifications can be made thereto without departing from the 
spirit or scope of the appended claims. 
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GGAGGGGGTG ACTGTCCAGA GCCTGGAACT GTGCCCACAC CAGAAGCCAT CAGCAGCAAG 180 

GACACC ATG CGG CTT CCG GGT GCG ATG CCA GCT CTG GCC CTC AAA GGC 228 
Met Arg Leu Pro Gly Ala Met Pro Ala Leu Ala Leu Lys Gly 
1 5 10 

GAG CTG CTG TTG CTG TCT CTC CTG TTA CTT CTG GAA CCA CAG ATC TCT 276 
Glu Leu Leu Leu Leu Sex Leu Leu Leu Leu Leu Glu Pro Gin He Ser 
15 20 25 30 

CAG GCC CTG GTC GTC ACA CCC CCG GGG CCA GAG CTT GTC CTC AAT GTC 324 
Gin Gly Leu Val Val Thr Pro Pro Gly Pro Glu Leu Val Leu Aen Val 
35 40 45 

TCC AGC ACC TTC GTT CTG ACC TGC TCG GGT TCA GCT CCG GTG GTG TGG 372 
Ser Ser Thr Phe Val Leu Thr Cys Ser Gly Ser Ala Pro Val Val Trp 
50 55 60 

GAA CGG ATG TCC CAG GAG CCC CCA CAG GAA ATG GCC AAG GCC CAG GAT 420 

Glu Arg Met Ser Gin Glu Pro Pro Gin Glu Met Ala Lys Ala Gin Asp 

65 70 75 

GGC ACC TTC TCC AGC GTG CTC ACA CTG ACC AAC CTC ACT GGG CTA GAC 46B 
Gly Thr Phe Ser Ser val Leu Thr Leu Thr Aen Leu Thr Gly Leu Asp 
80 85 90 

ACG GGA GAA TAC TTT TGC ACC CAC AAT GAC TCC CGT GGA CTG GAG ACC 51 6' 

Thr Gly Glu Tyr Phe eye Thr His Asn Asp Ser Arg Gly Leu Glu Thr 
95 100 105 110 

GAT GAG CGG AAA CCG CTC TAC ATC TTT GTG CCA GAT CCC ACC GTG GGC 564 
Asp Glu Arg Lye Arg Leu Tyr He Phe Val Pro Asp Pro Thr val Gly 
: 115 120 125 

TTC CTC CCT AAT GAT GCC GAG GAA CTA TTC ATC TTT CTC ACG GAA ATA 612 
Phe Leu Pro Asn Asp Ala Glu Glu Leu Phe He Phe Leu Thr Glu He 
130 135 140 

ACT GAG ATC ACC ATT CCA TGC CGA GTA ACA GAC CCA CAG CTG GTG GTG 660 
Thr Glu He Thr He Pro Cys Arg Val Thr Asp Pro Gin Leu Val Val 
145 150 155 

ACA CTG CAC GAG AAG AAA GGG GAC GTT GCA CTG CCT GTC CCC TAT GAT 708 
Thr Leu His Glu Lys Lys Gly Asp Val Ala Leu Pro Val Pro Tyr Asp 
160 165 170 

CAC CAA CGT GGC TTT TCT GGT ATC TTT GAG GAC AGA AGC TAC ATC TGC 756 
His Gin Arg Gly Phe Ser Gly He Phe Glu Asp Arg Ser Tyr lie Cys 
175 180 185 190 

AAA ACC ACC ATT GGG GAC AGG GAG GTG GAT TCT GAT GCC TAC TAT GTC . 804 

Lys Thr Thr He Gly Asp Arg Glu Val Asp Ser Asp Ala Tyr Tyr Val 
195 200 205 

TAC AGA CTC CAG GTG TCA TCC ATC AAC GTC TCT GTG AAC GCA GTG CAG 852 
Tyr Arg Leu Gin Val Ser Ser He Asn Val ser Val Asn Ala Val Gin 
210 215 220 

ACT GTG GTC CGC CAG GGT GAG AAC ATC ACC CTC ATG TGC ATT GTG ATC 900 
. Thr Val Val Arg Gin Gly Glu Asn lie Thr Leu Met Cys He Val He 
225 230 235 

GGG AAT GAT GTG GTC AAC TTC GAG TGG ACA TAC CCC CGC AAA GAA AGT 948 

Gly Asn Asp Val Val Asn Phe Glu Trp Thr Tyr Pro Arg Lys Glu Ser 
240 245 250 

GGG CGG CTG GTG GAG CCG GTG ACT GAC TTC CTC TTG GAT ATG CCT TAC 996 
Gly Arg Leu Val Glu Pro Val- Thr Asp Phe Leu Leu Asp Met Pro Tyr 
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His lie Arg Ser lie Leu Bis He Pro Ser Ala Glu Leu Glu Asp Ser 

290 2 " 

■ssasassBaaassjass. 
assBaf'sasisEagsasa 

320 325 J 

saaaasasssagasaag 

335 3*0 340 

SBS'a|S2SSgSSSBgS 

''aasaaasagasassas 

370 375 

.-asaaaaaaasaasBas 



ass 39° 395 



TTC CAT GAG GAT GCT GAG GTC CAG CTC TCC TTC CAG CTA CAG ATC AAT 
Phe His Glu Asp Ala Glu Val Gin Leu Ser Phe Gin Leu Gin lie Asn 

400 *° 5 — - — * * AU 

sssaaa a a a'a a-a a a sj 

4^5 420 * 2P 

sssaasaaa a a a a a a 

435 A4w 

ATC TGG TCT 6CC TCC AGA GAC CTC AAA AGG TGT CCA CGT GAG CTG CCG 
Ue Trp Ser Ala Cys Arg Asp Leu Lys Arg Cys Pro Arg Glu Leu Pro 

450 455 

gsaasasssaaaa a a a 

465 470 

Basssasaassasass 

485 490 



480 



CTG CGT CTG CAG CAC GTS GAT CGG CCA CTG TCG GTG CGC TGC ACG CTG 
Leu Arg Leu Gin His Val Asp Arg Pro Leu Ser Val Arg Cys Tfcr Leu 
500 505 



495 



sissasaa'aaaa'saass 

• 5X5 520 

■gaaasaaa a is a' a a a a a 



535 540 

rrc CTC ACC ATC ATC TCC CTT ATC ATC CTC ATC ATG CTT TGG CAG AAG 
§5 ^r lie Ue ier Leu lie lie Leu lie Met Leu Trp Gin Lys 



1092 
1140 
1188 
1236 
1284 
1332 
1380 
1428 
1476 
1524 
1572 
1620 
1668 



1716 



1764 



1812 



1B60 
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560 565 570 

GAC CCC CAT GAG TAC ATC TAC GTG GAC CCC ATG CAG CTG CCC TAT GAC . 1956 

Asp Gly His Glu Tyr lie Tyr Val Asp Pro Met Gin Leu Pro Tyr Asp 

575 580 585 590 



TCC ACG TGG GAG CTG CCG CGG GAC CAG CTT GTG CTG GGA CGC ACC CTC 
Ser Thr Trp Glu Lou Pro Arg Asp Gin Leu Val Leu Gly Arg Thr Leu 
595 600 605 



2004 



GGC TCT GGG GCC TTT GGG CAG GTG GTG GAG GCC ACA GCT CAT GGT CTG 2052 
Gly Ser Gly Ala Phe Gly Gin Val Val Glu Ala Thr Ala His Gly Leu 
610 615 620 

AGC CAT TCT CAG GCC ACG ATG AAA GTG GCC GTC AAG ATG CTT AAA TCC 2100 
Ser His Ser Gin Ala Thr Met Lys Val Ala Val Lys Met Lau Lys Ser 
625 630 635 

ACA GCC CGC AGC AST GAG AAG CAA GCC CTT ATG TCG GAG CTG AAG ATC 2148 
Thr Ala Arg Ser Ser Glu Lys Gin Ala Leu Met Ser Glu Leu Lys lie 
640 645 650 

ATG* AGT CAC CTT GGG CCC CAC CTG AAC GTG GTC AAC CTG TTG GGG GCC 2196 
Met Ser His Leu Gly Pro His Leu Asn Val Val Asn Leu Leu Gly Ala 

655 660 665 670 

TGC ACC AAA GGA GGA CCC ATC TAT ATC ATC ACT GAG TAC TGC CGC TAC 2244 
cys Thr Lys Gly Gly Pro lie Tyr lie lie Thr Glu Tyr cys Arg Tyr 
1 1 675 680 685 

GGA GAC CTG GTG GAC TAC CTG CAC CGC AAC AAA CAC ACC TTC CTG CAG 2292 ' 

Gly Asp Leu Val Asp Tyr Leu His Arg Asn Lys His Thr Phe Leu Gin 
1 690 695 700 

CAC CAC TCC GAC AAG CGC CGC CCG CCC AGC GCG GAG CTC TAC AGC AAT 2340 
His-His Ser Asp Lys Arg Arg Pro Pro Ser Ala Glu Leu Tyr Ser Asn 
705 710 715 

GCT CTG CCC GTT GGG CTC CCC CTG CCC AGC CAT GTG TCC TTG ACC GGG 2388 
Ala Leu Pro Val Gly Lau Pro Leu Pro Ser His Val Ser Leu Thr Gly 
720 725 730 

GAG AGC GAC GGT GGC TAC ATG GAC ATG AGC AAG GAC GAG TCG GTG GAC 243 6 

Glu Ser Asp Gly Gly Tyr Met Asp Met Ser Lys Asp Glu Ser Val Asp 
735 . 740 745 750 

TAT GTG CCC ATG CTG GAC ATG . AAA GGA GAC GTC AAA TAT GCA GAC ATC 2484 
Tyr Val Pro Met Leu Asp Met Lys Gly Asp Val Lys Tyr Ala Asp lie 
755 760 765 

GAG TCC TCC AAC TAC ATG GCC CCT TAC GAT AAC TAC GTT CCC TCT GCC 2532 
Glu Ser Ser Asn Tyr Met Ala Pro Tyr Asp Asn Tyr Val Pro Ser Ala 

770 ' 775 780 

CCT GAG AGG ACC TGC CGA GCA ACT TTG ATC AAC GAG TCT CCA GTG CTA 2580 
Pro Glu Arg Thr Cys Arg Ala Thr Leu lie Asn Glu Ser Pro Val Leu 
785 790 795 

AGC TAC ATG GAC CTC GTG GGC TTC AGC TAC CAG GTG GCC AAT GGC ATG 2628 
Ser Tyr Met Asp Leu Val Gly Phe Ser Tyr Gin val Ala Asn Gly Mat 
800 805 810 

GAG TTT CTG GCC TCC AAG AAC TGC GTC CAC AGA GAC CTG GCG GCT AGG 2676 

Glu Phe Leu Ala Ser Lys Asn Cys Val His Arg Asp Lau Ala Ala Arg 
815 820 825 8.30 

AAC GTG CTC ATC TGT GAA GGC AAG CTG ,GTC AAG ATC TGT GAC TTT GGC 2724 
* Asn Val Leu lie Cys Glu Gly Lys Leu Val Lys He Cys Asp Phe Gly 
835 840 845 



mmaK! 78 Pcr/us«/<Krao 



g.sagsjssssssssssass 



SAC ATC TTC ACC TTG GGT GGC ACC CCT TAC CCA GAG CTG CCC ATG AAC 
■§S lie Phe tS Leu ay Gly Thr Pro Tyr Pro Glu Leu Pro Met Asn 

895 

GAG CAG TTC TAC AAT GCC ATC AAA CGG OCT TAC CGC ATG GCC CAG CCT 
Glu GXn Phe Tyr Asn Ala He Lys Arg Gly Tyr Arg Met Ala Gin Fro 

915 »*° 
GCC CAT GCC -TCC GAC GAG ATC TAT GAG ATC ATG CAG AAG TGC TGG GAA 

His Ala Ser Asp Glu. He Tyr Glu II. Met Gin Lys cys Trp Glu 
930 935 " u 

SAG AAG TTT GAG ATT CGG CCC CCC TTC TCC CAG CTG GTG CTG CTT CTC 
Glu Lys Phe Glu He Arg Pro Pro Phe Ser Gin Leu val Leu Leu Leu 
945 950 333 

GAG AGA CTG TTG GGC GAA GGT TAC AAA AAG AAG TAC CAG CAG GTG GAT 
Glu Arg Leu Leu Gly Glu Gly Tyr Lys Lys Lys Tyr Gin Gin Val Asp 
960 965 97 

GAG GAG TTT CTG AGG AGT GAC CAC CCA GCC ATC CTT CGG TCC CAG GCC 
Glu Glu Phe Leu Arg Ser Asp His Pro Ala lie Leu Arg . Ser Gin Ala 
975 9*° 98 ^ 

CGC TTG CCT GGG TTC CAT GGC CTC CGA TCT CCC CTG GAC ACC AGC TCC 
irg Leu Pro Gly Phe His Gly Leu Arg Ser Pro Leu Asp Thr Ser Ser 
995 1000 AUUS 

GTC CTC TAT ACT GCC GTG CAG CCC AAT GAG GGT GAC AAC GAC TAT ATC 

val Leu Tyr Thrjtla Val Gin Pro **» 5 <? lu Gly Asp Asn ^sp^Tyr lie 



1010 



2B20 



286B 



2916 



2964 



3012 



3060 



. 310B 



3156 



3204 



3252 



3300 



ATC CCC CTG CCT GAC CCC AAA CCT GAG GTT GCT GAC GAG GGC CCA CTG 
lie Pro Leu Pro Asp Pro Lys Pro Glu val Ala Asp Glu Gly Pro Leu 
1025 1030 loss 

GAG GGT TCC CCC AGC CTA GCC AGC TCC ACC CTG AAT GAA GTC AAC ACC 
5lu Gly Ser. Pro Ser Leu Ala ser Ser Thr Leu Asn Glu Val Asn Thr 
1040 10*5 1050 

*CC TCA ACC ATC TCC TGT GAC AGC CCC CTG GAG CCC CAG GAC GAA CCA 
S« s« Thr lie Ser Cys Asp Ser Pro Leu Glu Pro Gin Asp Glu Pro 
j.055 1060 1065 iU,u 

GAG CCA GAG CCC CAG CTT GAG CTC CAG GTG GAG CCG GAG CCG GAG CTG 
Glu Pro Glu Pro Gin Leu Glu Leu Gin Val Glu Pro Glu Pro Glu Leu 
1075 1080 iU63 

GAA CAG TTG CCG GAT TCG GGG TGC CCT GCG CCT CGG GCG GAA GCA GAG 
SlS Gin Leu Pro Asp ser Gly Cys Pro Ala Pro Arg Ala Glu Ala Glu 
1090 1095 ixuu 

GAT AGC TTC CTG TAGGGGGCTG GCCCCTACCC TGCCCTGCCT GAAGCTCCCC 3544 
Asp Ser Phe Leu 
1105 

. CGCTGCCAGC ACCCAGCATC TCCTGGCCTG GCCTGGCCGG GCTTCCTGTC AGCCAGGCTG 3604 
CCCTTATCAG CTGTCCCCTT CTGGAAGCTT T CTG CTC CTG ACGTGTTGTG CCCCAAACCC 



3348 



3396 



3444 



3492 



3664 
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TGCGAAAGTT AGCCTTGATG ACCCAGAATC TAGGATTCTC TCCCTGGCTG ACAGCTGGGG 3844 

AGACCGAATC CCTCCCXGGG AAGATTCTTG GAGTTACTGA GGTGGTAAAT TAACTTTTTT 3904 

CTGTTCXGCC AGCTACCCCT CAAGGAATCA TAGCTCTCTC CTCGCACTTT TATCCACCCA 3964. 

GGAGCTAGGG AAGAGACCCT AGCCTCCCTG GCTGCTGGCT GAGCTAGGGC CTAGCCTTGA 4G24 

GCA6TGTTGC CTCATCCAGA AGAAAGCCAG TCTCCTCCCT ATGATGCCAG TCCCTGCGTT 4084 

CCCTGGCCCG AGCTGGTCTG GGGCCATTAG GCAGCCTAAT TAATGCTGGA GGCTGAGCCA ' ■ 4144 

AGTACAGGAC ACCCCCAGCC TGCAGCCCTT GCCCAGGGCA CTTGGAGCAC ACGCAGCCAT 4204 

AGCAAGTGCC T G TGT CC CTG TCCXTCACGC CCATCAGTCC TGGGGCTTTT TCTTTATCAC 4264 

CCTCAGTCTT AATCCATCCA CCAGAGTCTA GAAGGCCAGA CGGGCCCCGC ATCTGTGATG 4324 

AGAATGTAAA TGTGCCAGTG TGGAGTGGCC ACGTGTGTGT GCCAGATATG GCCCTGGCTC 4384 

TGCATTGGAC CTGCTATGAG GCTTTGGAGG AATCCCTCAC CCTCTCTGGG CCTCAGTTTC 4444 

CCCTTCAAAA AATGAATAAG TCGGACTTAT TAACTCTGAQ TCCCTTGCCA GCACTAACAT 4504 

TCTAGAGTAT CCAGGTGGTT GCACATTTGT CCAGATGAAG CAAGGCCATA TACCCXAAAC 4 564 

TTCCATCCTG GGGGTCAGCT GGGCTCCTGG GAGATTCCAG ATCACACATC ACACTCTGGG 4624 

GACTCAGGAA CCATGCCCCT TCCCCAGGCC CCCAGCAAGT CTCAAGAACA CAGCTGCACA ■ 4684 

GCCCTTGACT TAGAGTGACA GCCGGTGTCC TGGAAAGCCC CCAGCAGCTG CCCCAGGGAC 4744 

ATGGGAAGAC CACGGGACCT CTTTCACTAC CCACGATGAC CTCCGGGGGT ATCCTGGGCA 4804 

AAAGGGACAA AGAGGGCAAA TGAGATCACC TCCTGCAGCC CACCACTCCA GCACCTGTGC . 4864 

CGACGTCTGC GTCGAAGACA GAATGGACAG TGAGGAGAGT TATGTCTTGT AAAAGACAAG 4924 

AAGCTTCAGA TGGGTACCCC AAGAAGGATG TGAGACGTGG GCGCTTTGGA GGTTTGCCCC 4984 

TCACCCACCA GCTGCCCCAT CCCTGAGGCA GCGCTCCATG GGGGTATGGT TTTGTCACTG 5044 

CCCAGACCTA GCAGTGACAT CTCATTGTCC CCAGCCCAGT GGGCATTGGA GGTGCCAGGG 5104 

GAGTCAGGGT TGTAGCCAAG ACGCCCCCGC ACGGGGAGGG TTGGGAAGGG GGTGCAGGAA 5164 

. GCTCAACCCC TCTGGGCACC AACCCTGCAT TGCAGGTTGG CACCTTACTT CCCTGGGATC 5224 

CCAGAGTTGG TCCAAGGAGG GAGAGTGGGT TCTCAATACG GTACCAAAGA TATAATCACC 5284 

TAGGTTTACA AATATTTTTA GGACTCACGT TAACTCACAT TTATACAGCA GAAATGCTAT 5344 

TTTGTATGCT GTTAAfcTTTT TCTATCTGTG TACTTTTTTT TAAGGGAAAG ATTTTAATAT 5404 

TAAACCTGGT GCTTCTCACT CAC ' 5427 



(2) INFORMATION FOR SEQ ID HO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1106 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
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20 25 

Lw val Val Thr Pro Pro Gly Pro Glu Leu Val Leu Asn Val Scr ser 



35 40 
Thr Phe Val Leu Thr cy B ser Gly Ser Ala Pro Val Val Trp Glu Arg 

50 55 
Met ser Gin Glu Pro Pro Gin Glu Met Ala Lys Ala Gin Asp Gly Thr 

65 70 7S 

Phe Ser Ser Val Leu Thr Leu Thr Asn Leu Thr Gly Leu Asp Thr Gly 
85 

Glu Tyr Phe cys Thr His Asn Asp Ser Arg Gly Leu Glu Thr Asp Glu 

100 i05 
Arg Lys Arj Leu Tyr lie Phe Val Pro A.p Pro Thr Val Gly Phe Leu 

Pro Asn Asp Ala Glu Glu Leu Phe lie Phe Leu Thr Glu lie Thr Glu 

130 135 
II. Thr lie Pro cys Arg Val Thr Asp Pro Gin Leu Val Val Thr Leu 

His Glu Lys Lys Gly Asp Val Ala Leu Pro Val Pro Tyr Asp His Gin 

Ar, Gly Phe Ser Gly He Phe Glu Asp Arg Ser Tyr lie Cys Lys Thr 

Thr lie GJ| Asp Arg Glu Val Asp Ser Asp Ala Tyr Tyr Val Tyr Arg 

I*u Gin val Ser Ser He Asn Val Ser Val Asn Ala Val Gin Thr val 

210 215 
Val Arg Gin Gly Glu Asn He Thr Leu Met cys lie Val He Gly Asn 

225 230 

Asp val val Asn Phe Glu Trp Thr Tyr Pro Arg Lys Glu Ser Gly Arg 

!*u Val Glu Pro Val Thr Asp Phe Leu Leu Asp Met Pro Tyr His lie 

260 265 
Arg ser He Leu His He Pro ser Ala Glu Leu Glu Asp Ser Gly Thr 

275 ^80 *° 

Tyr Thr cys Asn Val Thr Glu Ser Val Asn Asp His Gin Asp Glu Lys 

" Ma He Asn He Thr Val Val Glu ser Gly Tyr Val Arg Leu Leu Gly 
305 31° 3 " 

Glu Val Gly Thr Leu Gin Phe. Ala Glu Leu His Arg Ser Arg Thr Leu 

325 330 

Gin val val Phe Glu Ala Tyr Pro Pro Pro Thr Val Leu Trp Pne Lys 

340 3 « 5 
Asp Asn Arg Thr' Leu Gly Asp ser ser Ala Gly Glu lie Ala i*u Ser 

Thr Arg Asn val ser Glu Thr Arg Tyr val. ser Glu Leu Thr Leu val 

370 375 
Arg val Lys Val Ala Glu Ala Gly His Tyr Thr Met Arg Ala Phe His 
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val Arg Val Leu Glu Leu ser Glu ser His Pro Asp ser Gly Glu Gin 
420 425 430 

Thr Val Arg cys Arg Gly Arg Gly Met Pro Gin Pro Asn He He Trp 
435 440 445 

Ser Ala cys Arg Asp Leu Lys Arg cys Pro Arg Glu Leu Pro Pro Thr 
450 455 460 

Leu Leu Gly Asn Ser Ser Glu Glu Glu Ser Gin Leu Glu Thr Asn Val 
465 470 475 480 

Thr Tyr Trp Glu Glu Glu Gin Glu Phe Glu Val Val Ser Thr Leu Arg 
485 . . 490 495 

Leu Gin His Val Asp Arg Pro Leu Ser Val Arg Cys Thr Leu Arg Asn 
500 505 510 

Ala Val Gly Gin Asp Thr Gin Glu Val lie Val Val Pro His Ser Leu 
515 520 525 

Pro Phe Lys Val Val Val He Ser Ala He Leu Ala Leu Val Val Leu 
530 535 540 

Thr He He Ser Leu He He Leu He Met Leu Trp Glh Lys Lys Pro 
545 550 555 560 ' 

Arg Tyr Glu He Arg Trp Lys Val He Glu Ser Val Ser Ser Asp Gly 
565 570 575 

His Glu Tyr He Tyr Val Asp Pro Met Gin Leu Pro Tyr Asp Ser Thr 
580 585 590 

Trp Glu Leu Pro Arg Asp Gin Leu Val Leu Gly Arg Thr Leu Gly Ser 
595 600 605 

Gly Ala Phe Gly Gin Val Val Glu Ala Thr Ala His Gly Leu Ser His 
610 615 620 

Ser Gin Ala Thr Met Lys Val Ala Val Lys Met Leu Lys Ser Thr Ala 
625 630 • 635 640 

Arg Ser Ser Glu Lys Gin Ala Leu Met Ser Glu Leu Lys He Met Ser 
645 650 655 

His Leu Gly Pro His Leu Asn Val Val Asn Leu Leu Gly Ala Cys Thr 
660 665 670 

Lys Gly Gly Pro He Tyr He He Thr Glu Tyr Cys Arg Tyr Gly Asp 
675 680 685 

. Leu Val Asp Tyr Leu His Arg Asn Lys His Thr Phe Leu Gin His His 
690 695 700 

. Ser Asp Lys Arg Arg Pro Pro Ser Ala Glu Leu Tyr Ser Asn Ala Leu 
705 710 715 720 

Pro Val Gly Leu Pro Leu Pro Ser His val Ser Leu Thr Gly Glu Ser 
725 730 735 

Asp Gly Gly Tyr Met Asp Met Ser Lys Asp Glu Ser Val Asp Tyr Val 
740 .745 750 

Pro Met Leu Asp Met Lys Gly Asp Val Lys Tyr Ala Asp He Glu Ser 
755 760 765 

Ser Asn Tyr Met Ala Pro Tyr Asp Asn Tyr Val Pro Ser Ala Pro Glu 
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Het Asp Leu val Gly Phe Ser Tyr Gin Val Ala Asn Gly Met Glu phe 
r 805 81° 815 

Leu Ala Ser Lys Asn cys Val His Arg Asp Leu Ala Ala Arg Asn Val 
820 825 83° 

Leu lie cys Glu Gly Lys Leu Val Lys He Cys Asp Phe Gly Leu Ala 
835 8 40 8*5 

Arg Asp lie Met Arg Asp Ser Asn Tyr He Ser Lys Gly Ser Thr Phe 
850 855 860 

Leu Pro Leu Lys Trp Met Ala Pro Glu Ser He Phe - Asn Ser Leu Tyr 
865 870 875 880 

Thr Thr Leu Ser Asp Val Trp Ser Phe Gly He Leu Leu Trp Glu He 
885 890 895 

Phe Thr Leu Gly Gly Thr Pro Tyr Pro Glu Leu Pro Met Asn Glu Gin 
900 905 910 

Phe Tyr Asn Ala He Lys Arg Gly Tyr Arg Met Ala Gin Pro Ala His 
915 920 925 

Ala Ser Asp Glu He Tyr Glu He Met Gin Lys Cys Trp Glu Glu Lys 
930 935 940 

Phe Glu He Arg Pro Pro Phe Ser Gin Leu Val Leu Leu Leu Glu Arg 
945 950 955 960 

Leu Leu Gly Glu Gly Tyr Lys Lys Lys Tyr Gin Gin Val Asp Glu Glu 
965 970 975 

Phe Leu Arg Ser Asp His Pro Ala He Leu Arg Ser Gin Ala Arg Leu 
980 985 990 

Pro Gly Phe His Gly Leu Arg Ser Pro Leu Asp Thr Ser Ser Val Leu 
995 1000 1005 

Tyr Thr Ala Val Gin Pro Asn Glu Gly Asp Asn Asp Tyr He He Pro 
1010 1015 1020 

Leu Pro Asp Pro Lys Pro Glu Val Ala Asp Glu Gly Pro Leu Glu Gly 
XC25 1030 1035 1040 

Ser Pro Ser Leu Ala Ser Ser Thr Leu Asn Glu Val Asn Thr Ser Ser 
1045 1050 1055 

Thr lie Ser Cys Asp Ser Pro Leu Glu Pro Gin Asp Glu Pro Glu Pro 
. 1060 1065 1070 

Glu Pro Gin Leu Giu Leu Gin val Glu Pro Glu Pro Glu Leu Glu Gin 
1075 1080 1085 

Leu Pro Asp Ser Gly Cys Pro Ala Pro Arg Ala Glu Ala Glu Asp Ser 
1090 _ 1095 1100 

Phe Leu 
1105 

(2) INFORMATION FOR SEQ- ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(iv) anti-sense: no 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM*: Homo Sapiens 

(B) STRAIN: lambda griO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 129.. 3395 

0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTGGAGCTAC AGGGAGAGAA ACAGAGGAGG AGACTGCAAG AGATCATTGG AGGCCGTGGG 60 

CACCCTCTTT ACTCCATGTG TGGGACATTC ATTGCGGAAT AACATCGGAG GAGAAGTTTC 120 

CCAGAGCT ATG GGG ACT TCC CAT CCG GCG TTC CTG GTC TTA GCC TGT CTT 170 
Met Gly Thr Ser His Pro Ala Phe Leu Val Leu Gly Cys Leu 
1 5 io 

CTC ACA GGG CTG AGC CTA ATC CTC TGC' CAG CTT TCA TTA CCC TCT ATC 218 
Uu Thr Gly Leu Ser Leu Ila Leu cys Gin Leu Ser Leu Pro Ser lie 
15 20 25 30 

CTT CCA AAT GAA AAT GAA AAG GTT GTG CAG CTG AAT TCA TCC TTT TCT 266 
Leu Pro Asn Glu Asn Glu Lys Val Val Gin Leu Asn Ser Ser Phe Ser 
35 40 45 

CTG AGA TGC TTT GGG GAG AGT GAA GTG AGC TGG CAG TAC CCC ATG TCT 314 
Leu Arg Cys Phe Gly Glu Ser Glu Val Ser Trp Gin Tyr Pro Met Ser 
5° 55 60 

GAA GAA GAG AGC TCC GAT GTG GAA ATC AGA AAT GAA GAA AAC AAC AGC 3 62 

Glu Glu Glu Ser Ser Asp Val Glu lie Arg Asn Glu Glu Asn Asn Ser 
65 70 75 

GGC CTT TTT GTG ACG GTC TTG GAA GTG AGC AGT GCC TCG GCG CCC CAC 410 
Gly Leu Phe Val Thr Val Leu Glu Val Ser Ser Ala Ser Ala Ala His 
80- 85 90 

ACA GGG TTG TAC ACT TGC TAT TAC AAC CAC ACT CAG ACA GAA GAG AAT 458 
Thr Gly. Leu Tyr Thr Cys Tyr Tyr Asn His Thr Gin Thr Glu Glu Asn 
95 100 105 110 

GAG CTT GAA GGC AGG CAC ATT TAC ATC TAT GTG CCA GAC CCA GAT GTA 506 

Glu Leu Glu Gly Arg His He Tyr lie Tyr Val Pro Asp Pro Asp Val 
"5 120 125 

GCC TTT GTA CCT CTA GGA ATG ACG GAT TAT TTA GTC ATC GTG GAG GAT 554 
Ala Phe Val Pro Leu Gly Met Thr Asp Tyr Leu Val He val Glu Asp 
130 135 140 

GAT GAT TCT GCC ATT ATA CCT TGT CGC ACA ACT GAT CCC GAG ACT CCT 602 
Asp Asp Ser Ala He He Pro Cys Arg Thr Thr Asp Pro Glu Thr Pro 

145 iso r 155 

H 7 ^ ££ C TT * C ^ C AAC AGT GAG 566 GTG GTA CCT GCC TCC TAC GAC AGC 650 
Val Thr Leu Has Asn Ser Glu Gly Val Val Pro Ala Ser Tyr Asp Ser 
160 165 170 

AGA CAG GGC TTT AAT GGG ACC TTC ACT GTA GGG CCC TAT ATC TGT GAG 698 
Arg Gin Gly Phe Asn Gly Thr Phe Thr Val Gly Pro Tyr He Cys Glu 
175 ISO 185 190 

?? C £5° S T ? GGA AAG AA G CAG ACC ATC CCA TTT AAT GTT TAT 74 6 

Ala Thr Val Lys Gly Lys Lys Phe Gin Thr He Pro Phe Asn Val Tyr 

195 200 205 



! 
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842 



305 



890 



938 



isassBsasffisasgsas 

225 230 

AAC AAT GAG" GTG GTT GAC CTT CAA TGG ACT TAC CCT GGA GAA GTG AAA 
Asn Asn Glu Val Val Asp Leu Gin Trp Thr Tyr Pro Gly Glu val Lys 
240 245 250 

GGC AAA GGC ATC ACA ATG CTG.GAA GAA ATC AAA GTC CCA TCC ATC AAA 
Gly Lys Gly lie Thr-Met: Leu Glu Glu lie Lys Val Pro Ser lie Lys 
255 260 265 . 

TO GTG TAC ACT TTG ACG GTC CCC GAC GCC ACG GTG AAA GAC ACT GGA 986 
25 35 t£ ttr ihr Val Pro Glu Ala Thr Val Lys Asp Ser Gly 
275 280 285 

evr TAC GAA TGT GCT GCC CGC CAG GCT ACC AGG GAG GTC AAA GAA ATG 1034 
Sp £r §& ^ Sa irg Gin Ala Thr Arg Glu Val Lys Glu Met 
290 295 ** uu 

*ar xiA ere ACT ATT TCT GTC CAT GAG AAA GGT TTC ATT GAA ATC AAA 1082 
K i£ §5 SJ 12 $11 Ss Glu Lys Gly Ph. lie Glu He Lys 



CCC ACC TTC AGC CAG TTG GAA GCT GTC AAC CTG CAT GAA GTC AAA CAT 1130 
Pro Thr Phe Ser Gin Leu Glu Ala Val . Asn Leu His Glu Val Lys Hxs 
320 325 330 

TTT GTT GTA GAG GTG CGG GCC TAC CCA CCT CCC AGG ATA TCC TGG CTG . 1178 
Phe Val Val Glu Val Arg Ala Tyr Pro Pro Prp Arg He Ser Trp Leu 

335 340 345 350 

AAA AAC AAT CTG ACT CTG ATT GAA AAT CTC ACT GAG ATC ACC ACT GAT 1226 
Lys Asn Asn Leu Thr Leu lie Glu Asn Leu Thr Glu He Thr Thr Asp 
1 355 360 365 

GTG GAA AAG ATT CAG GAA ATA AGG TAT CGA AGC AAA TTA AAG CTG ATC 1274 

Val Glu Lys lie Gin Glu He Arg Tyr Arg Ser Lys Leu Lys Leu He 
370 375 3SV 



1322 



CGT GCT AAG GAA GAA GAC AGT GGC CAT TAT ACT ATT GTA GCT CAA AAT 
Arg Ala Lys Glu Glu Asp Ser Gly His Tyr Thr He val Ala Gin Asn 
385 390 395 

GAA GAT GCT GTG AAG AGC TAT ACT TTT GAA CTG TTA ACT CAA GTT CCT 1370 
Glu Asp Ala Val Lys Ser Tyr Thr Phe Glu Leu Leu Thr Gin Val Pro 
400 405 410 

TCA TCC ATT CTG GAC TTG GTC GAT GAT CAC CAT GGC TCA ACT GGG GGA 1413 
ler IS. lie Lei Asp leu Val Asp Asp His His Gly Ser Thr Gly Gly 
.415 420 425 430 

CAG ACG GTG AGG TGC ACA GCT GAA GGC ACG CCG CTT CCT GAT ATT GAG 1466 

Gin ^r Val Arg cys Thr Ala Glu Gly Thr Pro Leu Pro Asp He Glu 

435 440 440 

TGG ATG ATA TGC AAA GAT ATT AAG AAA TGT AAT AAT GAA ACT TCC TGG 1514 
Trp Het lie Cys Lys Asp He Lys. Lys Cys Asn Asn Glu Thr Ser Trp 
450 455 45U 

ACT ATT TTG GCC AAC AAT GTC TCA AAC ATC ATC ACG GAG ATC CAC TCC 1562 
Thr He Leu Ala Asn Asn Val Ser Asn He He Thr Glu He His Ser 
465 470 475 

CGA GAC AGG AGT ACC GTG GAG GGC CGT GTG ACT TTC GCC AAA GTG GAG 1610 

Arg Asp Arg Ser Thr Val Glu Gly Arg Val Thr Phe Ala Lys Val Glu 

. 480 485 490 
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AAC CGA GAG CTG AAG CTG GTG GCT CCC ACC CTG CGT TCT GAA CTC ACG 1706 
Aan Arg Glu Leu Lys Leu val Ala Pro Thr Leu Arg ser Glu Leu Thr 
515 520 525 

GTG GCT GCT GCA CTC CTG GTG CTG TTG GTG ATT GTG ATC ATC TCA CTT 1754 
Val Ala Ala Ala Val Leu Val Leu Leu Val lie Val He He Ser Leu 
. 530 535 540 

ATT GTC CTG GTT GTC ATT TGG AAA CAG AAA CCG AGG TAT GAA ATT CGC 1802 
lie Val Xeu Val Val He Trp Lys Gin Lys Pro Arg Tyr Glu He Arg 
545 550 555 

TGG AGG GTC ATT GAA TCA ATC AGC CCA GAT GGA CAT GAA TAT ATT TAT 1850 

Trp Arg Val He Glu Ser He Ser Pro Asp Gly His Glu Tyr He Tyr 
560 565 570 

GTG GAC CCG ATG CAG CTG CCT TAT GAC TCA AGA TGG GAG. TTT CCA AGA 1898 

Val Asp Pro Met Gin Leu Pro Tyr Asp Ser Arg Trp Glu Phe Pro Arg 
575 580 585 . 590 

GAT GGA CTA GTG CTT GGT CGG GTC TTG GGG TCT GGA GCG TTT GGG AAG 1946 
Asp Gly Leu Val Leu Gly Arg Val Leu Gly Ser Gly Ala Phe Gly Lys 
595 600 605 

GTC GTT GAA GGA ACA GCC TAT GGA TTA AGC CGG TCC CAA CCT GTC ATG 1994 

Val Val Glu Gly Tnr Ala Tyr Gly Leu Ser Arg Ser Gin Pro Val Met 

610 615 620 

AAA GTT GCA GTG AAG ATG CTA AAA CCC ACG GCC AGA TCC AGT GAA AAA 2042 
Lys Val Ala Val Lys Her Leu Lys Pro Thr Ala Arg Ser Ser Glu Lys 
625 630 635. 

CAA GCT CTC ATG TCT GAA CTG AAG ATA ATG ACT CAC CTG GGG CCA CAT 2090 

Gin Ala Leu Met Ser Glu Leu Lys He Met Thr His Leu Gly Pro His 
640 645 650 

TTG AAC ATT GTA AAC TTG CTG GGA GCC TGC ACC AAG TCA GGC CCC ATT 2138 
Leu Asn He Val Asn Leu Leu Gly Ala Cys Thr Lys Ser Gly Pro He 
655 660 665 670 

TAC ATC ATC ACA GAG TAT TGC TTC TAT GGA GAT TTG GTC AAC TAT TTG 2186 
Tyr He He Thr Glu Tyr Cys Phe Tyr Gly Asp Leu Val Ash Tyr Leu 
675 680 685 

CAT AAG AAT AGG GAT AGC TTC CTG AGC CAC CAC CCA GAG AAG CCA AAG 2234 
His Lys Asn Arg Asp Ser Phe Leu Ser His His Pro Glu Lys Pro Lys 
690 695 700 

AAA GAG CTG GAT ATC TTT GGA TTG AAC CCT GCT GAT GAA AGC ACA CGG 2282 
Lys Glu Leu Asp He Phe Gly Leu Asn Pro Ala Asp Glu Ser Thr Arg 
705 710 715 

AGC TAT GTT ATT TTA TCT TTT GAA AAC AAT GGT GAC TAC ATG GAC ATG 233 0 

Ser Tyr Val He Leu Ser Phe Glu Asn Asn Gly Asp Tyr Met Asp Met 
720 725 730 

AAG CAG GCT GAT ACT ACA CAG TAT GTC CCC ATG CTA GAA AC J AAA GAG 2378 

Lys Gin Ala Asp Thr Thr Gin Tyr val Pro Met Leu Glu Arg Lys Glu 
735 740 745 750 

GTT TCT AAA. TAT TCC GAC ATC CAG AGA TCA CTC TAT GAT CGT CCA GCC 2426 
Val Ser Lys Tyr Ser Asp He Gin Arg Ser Leu Tyr Asp Arg Pro Ala 
755 760 765 

TCA TAT AAG AAG AAA TCT ATG TTA GAC TCA GAA GTC AAA AAC CTC CTT 2474 
Ser Tyr Lys Lys Lys Ser Met Leu Asp Ser Glu Val Lys Asn Leu Leu 
770 775 780 



TCA GAT GAT AAC TCA GAA GGC CTT ACT TTA TTG GAT TTG TTG AGC TTC 



2522 



! 
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Tfar Tyr Gin Val Ala Arg Gly Met Glu Phe Leu Ala Ser Lys Asn cys 

800 805 810 

GTC CAC CGT GAT CTG GCT GCT CGC AAC GTT CTC CTG GCA CAA GGA AAA 
Val His Arg Asp Leu Ala Ala Arg Asn Val Lau Lou Ala Gin Gly Lys 
815 820 825 

ATT GTG AAG ATC TGT GAC TTT GCC CTG GCC AGA GAC ATC ATG CAT GAT 
lie Val Lys lie cys Asp Phe Gly Leu Ala Arg Asp lie Ket Hxs Asp 
835 840 

TCG AAC TAT GTG TCG AAA GGC ACT ACC TTT CTG CCC GTG AAG TCG .ATG 

sir Asn Tyr Val Ser Lys Gly Ser Tfar Pfaa Leu Pro Val Lys Trp Met 
850 855 860 

GCT CCT GAG AGC ATC TTT GAC AAC CTC TAC ACC ACA CTG ACT GAT GTC 
Ala Pro Glu Ser He Phe Asp Asn Leu Tyr Thr Thr Leu Ser Asp Val 

865 870 875 

TGG TCT TAT GGC ATT CTG CTC TGG GAG ATC TTT TCC CTT GGT GGC ACC 

Sp ser Tyr Gly lie Leu Leu Trp Glu He Phe Ser Leu Gly Gly Thr 
K 8B0 885 890 

CCT TAC CCC GGC ATG ATG GTG GAT TCT ACT TTC TAC AAT AAG ATC AAG 
Pro Tyr Pro Gly Met Met Val Asp Ser Thr Phe Tyr Asn Lys He Lys 
895 900 905 91° 

AGT GGG TAC CCG ATG GCC AAG CCT GAC CAC GCT ACC AGT GAA GTC TAC 
Ser Gly Tyr Arg Met Ala Lys Pro Asp His Ala Thr Ser Glu. Val Tyr 
915 920 925 

GAG ATC ATG GTG AAA TGC TGG AAC AGT GAG CCG GAG AAG AGA CCC TCC 
Glu He Ket Val Lys Cys Trp Asn Ser Glu Pro Glu Lys Arg Pro ser 
930 935 940 ' 

TTT TAC CAC CTG AGT GAG ATT GTG GAG AAT CTG CTG CCT GGA CAA TAT 
Phe Tyr His Leu Ser Glu He Val Glu Asn Leu Leu Pro Gly Gin Tyr 
945 950 955 

AAA AAG AGT TAT GAA AAA ATT CAC CTG GAC TTC CTG AAG AGT GAC CAT 
Lys Lys ser Tyr Glu Lys He His Leu Asp Phe Leu Lys Ser Asp his 
960 965 970 

CCT GCT GTG GCA CGC ATG CGT GTG GAC TCA GAC AAT GCA TAC ATT GGT 
pro Ala Val Ala Arg Met Arg val Asp Ser Asp Asn Ala Tyr He Gly 
975 980 985 

. GTC ACC TAC AAA AAC GAG GAA GAC AAG CTG AAG GAC TGG GAG GGT GGT 
Val Thr Tyr Lys Asn, Glu Glu Asp Lys Leu Lys Asp Trp Glu Gly Gly 
995 1000 1005 

CTG GAT GAG CAG AGA CTG AGC GCT GAC AGT GGC TAC ATC ATT CCT CTG 
Leu Asp Glu Gin Arg Leu Ser Ala Asp Ser Gly Tyr lie He Pro Leu 
1010 1015 I" 20 

CCT GAC ATT GAC CCT GTC CCT GAG GAG GAG GAC CTG GGC AAG AGG AAC 
Pro Asp He Asp Pro Val. Pro Glu Glu Glu Asp Leu Gly Lys Arg Asn 
1025 1030 1035 

AGA CAC AGC" TCG CAG ACC TCT GAA GAG AGT GCC ATT GAG ACG GGT TCC 
Arg His Ser Ser Gin Thr Ser Glu Glu Ser Ala He Glu Thr Gly Ser 
1040 1045 1050 

AGC AGT TCC ACC TTC ATC AAG AGA GAG GAC GAG ACC ATT GAA GAC ATC 
Ser Ser Ser Thr. Phe He Lys Arg Glu Asp Glu Thr He Glu Asp lie 
1055 1060 1065 1070 

GAC ATG ATG GAC GAC ATC GGC ATA GAC TCT TCA GAC CTG GTG GAA GAC 
Asp Met Met Asp Asp He Gly He Asp Ser Ser Asp Leu Val Glu Asp 



2618 



2666 



2714 



2762 



2810 



2858 



2906 



2954 



3002 



3050 



3098 



3146 



3194 



3242 



3290 



3338 



3386 



- 87 
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Ser Phe Leu 

ACCTCTGGAT CCCCTTCAGA AAACCACTTT ATTGCAATGC GGAGGTTGAG AGGAGGACTT 3495 

GGTTGATGTT TAAAGAGAAG TTCCCAGCCA AGGGCCTCGG GGAGCCTTTC TAAATATGAA 3555 

TGAATGGGAT ATTTTGAAAT GAACTTTGTC AGTGTTGCCT CTTGCAATGC CTCAGTAGCA 3615 

TCTCACTGGT CTGTGAAGTT TGGAGATAGA TGGATAAGGG AAXAATAGGC CACAGAAGGT 3675 

GAACTTTCTG CTTCAAGGAC ATTGGTGAGA GTCCAACAGA CACAATTTAT ACTGCGACAG 3735 

AACTTCAGCA TTCTAATTAT GTAAATAACT CTAACCACGG CTGTGTTTAG ATTGTATTAA 3795 

CTATCTTCTT TGGACTTCTG AAGAGACCAC TCAATCCATC CATGTACTTC CCTCTTGAAA 3855 

CCTGATGTCA GCTGCTGTTG AACTTTTTAA AGAAGTGCAT GAAAAACCAT TTTTGACCTT 3915 

AAAAGGTACT GGXACTATAG CATTTTGCTA TCTTTTTTAG TGTTAAAGAG ATAAAGAATA 3975 

ATAATTAACC AACCTTGTTT AATAGATTTG GGTCATTTAG AACCCXGACA ACTCATTTTC 4035 

ATATTGTAAT CTATGTTTAT. AATACTACTA CTGTTATCAG TAATGCTAAA TGTGTAATAA 4095 

TGTAA 4100 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Thr Ser His Pro Ala Phe Leu Val Leu Gly Cys Leu Leu Thr 
1 5 10 15 

Gly Leu Ser Leu lie Leu Cys Gin Leu Ser Leu Pro Ser lie Leu Pro 
20 25 30 

Asn Glu Asn Glu Lys Val Val Gin Leu Asn Ser Ser Phe Ser Leu Arg 
35 40 45 

Cys Phe Gly Glu Scr Glu Val Ser Trp Gin Tyr Pro Met Ser Glu Glu 
50 55 60 

Glu Ser Ser Asp Val Glu lie Arg Asn Glu Glu Asn Asn Ser Gly Leu 
65 70 75 80 

Phe Val Thr Val Leu Glu Val Ser Ser Ala Ser Ala Ala His Thr Gly 
85 90 95 

Leu Tyr Thr Cys Tyr Tyr Asn His Thr Gin Thr Glu Glu Asn Glu Leu 
100 105 110 

Glu Gly Arg His lie Tyr He Tyr Val Pro Asp Pro Asp Val Ala Phe 
115 120. 125 

Val Pro Leu Gly Met Thr Asp Tyr Leu Val He Val Glu Asp Asp Asp 
130 135 140 

Ser Ala lie He Pro Cys Arg Thr Thr Asp Pro Glu Thr Pro Val Thr 
145 150 • 155 160 
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ISO. 185 190 

Val Lys Gly Lys Lys Phe Gin Thr lie Pro Phe Asn Val Tyr Ala Leu 
195 200 205 

Lys Ala Thr Ser Glu Leu Asp Leu Glu Met Glu Ala Leu Lys Thr Val 
210 215 220 

Tyr Lys Ser Gly Glu Thr He Val Val Thr eye Ala Val Phe Asn Asn 
225 230 235 240 

Glu Val Val Asp Leu Gin Trp Thr Tyr Pro Gly Glu Val Lys Gly Lys 
245 250 255 

Gly lie Thr Met Leu Glu Glu He Lys Val Pro. Ser lie Lys Leu Val 
260 265 270 

Tyr Thr Leu Thr Val Pro Glu Ala Thr Val Lys Asp Ser Gly Asp Tyr 

275 280 285 

Glu cys Ala Ala Arg Gin Ala Thr Arg Glu Val Lys Glu Met Lys Lys 
290 295 300 

Val Thr He Ser Val His Glu Lys Gly Phe lie Glu He Lys Pro Thr 
305 310 315 320 

Phe Ser Gin Leu Glu Ala Val Asn Leu His Glu Val Lys His Phe Val 
325 330 335 

Val Glu "Val Arg Ala Tyr Pro Pro Pro Arg He Ser Trp Leu Lys Asn 
340 345 350 

Asn Leu Thr Leu He Glu Asn Leu Thr Glu He Thr Thr Asp Val Glu 
355 360 365 

Lys lie Gin Glu He Arg Tyr Arg Ser Lys Leu Lys Leu He Arg Ala 
370 375 380 

Lys Glu Glu Asp Ser Gly His Tyr Thr He Val Ala Gin Asn Glu Asp 
385 390 395 400 

Ala Val Lys Ser Tyr Thr Phe Glu Leu Leu Thr Gin Val Pro Ser Ser 
405 410 415 

He Leu Asp. Leu Val Asp Asp His His Gly Ser Thr Gly Gly Gin Thr 
420 425 430 

Val Arg Cys Thr Ala Glu Gly Thr Pro Leu Pro Asp He Glu Trp Met 
435 440 445 

He Cys Lys Asp He Lys Lys Cys Asn Asn Glu Thr Ser Trp Thr He 
450 455 460 

Leu Ala Asn Asn Val Ser Asn He He Thr Glu He His Ser Arg Asp 
465 470 475 480 

Arg Ser Thr Val Glu Gly Arg Val Thr Phe Ala Lys Val Glu Glu Thr . 

485 490 495 

He Ala Val Arg cys Leu Ala Lys Asn Leu Leu Gly Ala Glu Asn Arg 
500 505 510 

Glu Leu Lys Leu Val Ala Pro Thr Leu Arg Ser Glu Leu Thr Val Ala 
515 520 525 

Ala Ala Val Leu Val Leu Leu Val He Val He He Ser Leu He Val 
530 535 540 

Leu Val Val He Trp Lys Gin Lys Pro Arg Tyr Glu He Arg Trp Arg 
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Pro Met Gin Leu Pro Tyr Asp Ser Arg Trp Glu Phe Pro Arg Asp Gly 
580 585 590 

Leu Val Leu Gly Arg Val Leu Gly Ser Gly Ala Phe Gly Lys Val Val 
595 600 605 

Glu Gly Thr Ala Tyr Gly Leu Ser Arg Ser Gin Pro Val Met Lys Val 
610 615 620 

Ala Val Lys Met Leu Lys Pro Thr Ala Arg Ser Ser Glu Lys Gin Ala 
625 630 635 640 

Leu Met Ser Glu Leu Lys He Met Thr His Leu Gly Pro His Leu Asn 
645 650 655 

He Vai Asn. Leu Leu Gly Ala cys Thr Lys Ser Gly Pro He Tyr He 
660 665 670 

He Thr Glu Tyr Cys Phe Tyr Gly Asp Leu Val Asn Tyr Leu His Lys 
675 680 685 

Asn Arg Asp Ser Phe Leu Ser His His Pro Glu Lys Pro Lys Lys Glu 
690 695 700 

Leu Asp He Phe Gly Leu Asn Pro Ala Asp Glu Ser Thr Arg Ser Tyr 
7.05 710 715 720 

Val lie Leu Ser Phe Glu Asn Asn Gly Asp Tyr Met Asp Met Lys Gin 
725 730 735 

Ala Asp Thr Thr Gin Tyr Val Pro Met Leu Glu Arg Lys Glu Val Ser 
740 745 750 

Lys Tyr Ser Asp He Gin Arg Ser Leu Tyr Asp Arg Pro Ala Ser Tyr 
755 760 765 

Lys Lys Lys Ser Met Leu Asp Ser Glu Val Lys Asn Leu Leu Ser Asp 
770 775 780 

Asp Asn Ser Glu Gly Leu Thr Leu Leu Asp Leu Leu Ser Phe Thr Tyr 
785 790 795 800 

Gin Val Ala Arg Gly Met Glu Phe Leu Ala Ser Lys Asn Cys Val His 
805 810 815 

Arg Asp Leu Ala Ala Arg Asn Val Leu Leu Ala Gin Gly Lys He Vai 
820 825 830 

Lys lie Cys Asp Phe Gly Leu Ala Arg Asp He Met His Asp Ser Asn 
835 840 845 

Tyr Val Ser Lys Gly Ser Thr Phe Leu Pro vai Lys Trp Met Ala Pro 
850 855 860 

Glu Ser He Phe Asp Asn Leu Tyr Thr Thr Leu Ser Asp Val Trp Ser 
865 870 875 880 

Tyr Gly He Leu Leu Trp Glu He Phe Ser Leu Gly Gly Thr Pro Tyr 
885 890 895 

Pro Gly Met Met Val Asp Ser Thr Phe Tyr Asn Lys He Lys Ser Gly 
900 905 910 

Tyr Arg Met Ala Lys Pro Asp His Ala Thr ser Glu Val Tyr Glu He 
915 920 925 

Met Val Lys cys Trp Asn Ser Glu Pro Glu Lys Arg Pro Ser Phe Tyr 

930 935 940 



9 0 PCT/US92/OO730 

WO 92/13867 



ser Tyr Glu Lys He His Leu Asp Phe Leu Lys Ser Asp His Pro Ala 

965 . 97W 

val Ala Arg Met Arg Val Asp Ser Asp Asn Ala Tyr lie Gly Val Thr 

980 9oS> 

Tyr Lys Asn Glu Glu Asp Lys Leu Lys Asp Trp Glu Gly Gly Leu Asp 

995 1000 1 

Glu Gin Arg Leu Ser Ala Asp ser Giy Tyr lie Il^Pro Leu Pro Asp 

1010 1015 xu^u 

lie Asp Pro val Pro Glu Glu Glu Asp Leu Gly Lys Arg Asn Arg Hi^ 
1025 1030 

ser ser Gin Thr Ser Glu Glu Ser Ala lie Glu Thr Gly Ser Ser Ser 
1045 1050 xu3=> 

ser Thr Phe He Lys Arg Glu Asp Glu Thr II. Glu Asp II. Asp Met 

1060 1065 

Met Asp Asp lie Gly lie Asp Ser Ser Asp Leu val Glu Asp s.r Phe 

1075 1080 xuod 

Leu 

(2)' INFORMATION FOR SEQ ID HO:5: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 

fix) FEATURE: 

(A) NAME/KEY: COS 

(B) LOCATION: 129.. 3395 , ^.^ . n + 
D OTHER INFORMATION: /note- "nucleotide number 1 of this 

11 sequence is identical to the nucleotide number 1 
of the previous 4100 long sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TTGGAGCTAC AGGGAGAGAA ACAGAGGAGG AGACTGCAAG AGATCATTGG AGGCCGTGGG 
CACGCTCTTT ACTCCATGTG TGGGACATTC ATTGCGGAAT AACATCGGAG GAGAAGTTTC 
CCAGAGCTAT GGGGACTTCC CATCCGGCGT TCCTGGTCTT AGGCTGTCTT CTCACAGGGC 
TGAGCCTAAT CCTCTGCCAG CTTTCATTAC CCTCTATCCT TCCAAATGAA AATGAAAAGG 240 
TTGTGCAGCT GAATTCATCC TTTTCTCTGA GATGCTTIGG GGAGAGTGAA GTGAGCTGGC 300 
AGTACCCCAT GTCTGAAGAA GAGAGCTCCG ATGTGGAAAT CAGAAATGAA GAAAACAACA 360 
GCGGCCTTTT TGTGACGGTC TTGGAAGTGA GCAGTGCCTC GGCGGCCCAC ACAGGGTTGT 
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TCATCGTCGA GGATGATGAT TCTGCCATTA TACCTTGTCG CACAACTGAT CCCGAGACTC 
CTGTAACCTT ACACAACAGT GAGGGGGTGG TACCTGCCTC CTACGACACC AGACAGGGCT 
TTAATGGGAC CTTCACTGTA GGGCCCTATA TCTGTGACGC CACCGTCAAA GGAAAGAAGT 
TCCAGACCAT CCCATTTAAT GTTTATGCTT TAAAAGCAAC ATCAGAGCTG GATCTAGAAA 
TGGAAGCTCT TAAAACCGTG TATAAGTCAG GGGAAACGAT TGTGGTCACC TCTGCTGTTT 
TTAACAATGA G G TG G TTGAC CTTCAATGGA CTTACCCTGC AGAAGTGAAA GGCAAAGGCA 
TCXCAATGCT GGAAGAAATC AAAGTCCCAT CCATCAAATT GGTGTACACT TTGACGGTCC 
CCGAGGCCAC GGTGAAAGAC ACTCGAGATT ACCAATGTGC TGCCCGCCAG GCTACCAGGG 
AGGTCAAAGA AATGAAGAAA GTCACTATTT CTGTCCATGA GAAAGGTTTC ATTGAAATCA 
AACCCACCTT CAGCCAGTTG GAAGCTCTCA ACCTGCATGA AGTCAAACAT TTTGTTGTAG 
AGGTGCGGGC CXACCCACCT CCCAGGATAT CCTGGCTGAA AAACAATCTG ACTCTGATTG 
AAAATCTCAC TGAGATCACC ACTGATGTGG AAAAGATTCA GGAAATAAGG TATCGAAGCA 
AATTAAAGCT GATCCGTCCT AAGGAAGAAG ACAGTCGCCA TTATACTATT GTAGCTCAAA 
ATGAAGATGC TGTGAAGAGC TATACTTTTG AACTGTTAAC TCAAGTTCCT TCATCCATTC 
TGGACTTGGT CGATGATCAC "CATGGCTCAA CTGGGGGACA GACGGTGAGG TGCACAGCTG 
AAGGCACGCC GCTTCCTGAT ATTGAGTGGA TGATATGCAA AGATATTAAG AAATCTAATA 
ATGAAACTTC CTGGACTATT TTGGCCAACA ATGTCTCAAA CATCATCACG GAGATCCACT 
CCCGAGACAG GAGTACCGTG GAGGGCCGTG TGACTTTCGC CAAAGTGGAG GAGACCATCG 
CCGTGCGATG CCTGGCTAAG AATCTCCXTG GAGCTGAGAA CCGAGAGCTG AAGCTGGTGG 
CTCCCACCCT GCGTTCTGAA CTCACGGTGG CTGCTGCAGT CCTGGTGCTG TTGGTGATTG 
TGATCATCTC ACTTATTGTC CTGGTTGTCA TTTGGAAACA GAAACCGAGG TATGAAATTC 
GCTGGAGGGT CATTGAATCA ATCAGCCCAG ATGGACATGA ATATATTTAT GTGGACCCGA 
TGCAGCTGCC TTATGACTCA AGATGGGAGT TTCCAAGAGA TGGACTAGTG CTTGGTCGGG 
TCTTGGGGTC TGGAGCGTTT GGGAAGGTGG TTGAAGGAAC AGCCTATGGA TTAAGCCGGT 
CCCAACCTGT CATGAAAGTT GCAGTGAAGA TGCTAAAACC CACGGCCAGA TCCAGTGAAA 
• AACAAGCTCT CATGTCTGAA CTGAAGATAA TGACTCACCT GGGGCCACAT TTGAACATTG 
TAAACTTGCT GGGAGCCTGC ACCAAGTCAG GCCCCATTTA CATCATCACA GAGTATTGCT 
TCTATGGAGA TTTGGTCAAC TATTTGCATA AGAATAGGGA TAGCTTCCTG AGCCACCACC 
CAGAGAAGCC AAAGAAAGAG CTGGATATCT TTGGATTGAA CCCTGCTGAT GAAAGCACAC 
GGAGCTATGT TATTTTATCT TTTGAAAACA ATGGTGACTA CATGGACATG AAGCAGGCTG 
ATACTACACA GTATGTCCCC ATGCTAGAAA GGAAAGAGGT TTCTAAATAT TCCGACATCC 
AGAGATCACT CTATGATCGT CCAGCCTCAT ATAAGAAGAA ATCTATGTTA GACTCAGAAG 
TCAAAAACCT CCTTTCAGAT GATAACTCAG AAGGCCTTAC TTTATTGGAT TTGTTGAGCT 
TCACCTATCA AGTTGCCCGA GGAATGGAGT TTTTGGCTTC AAAAAATTGT GTCCACCGTG 
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CCGTGAAGTG GATGGCTCCT GAGAGCATCT TTGACAACCT * CTACACCACA CTGAGTGATG 
TCTCGTCTTA TGGCATTCTG CTCTGGGAGA TCTTTTCCCT TGGTGGCACC CCTTACCCCG 
GCATGATGGT GGATTCTACT TTCTACAATA AGATCAAGAG TGGGTACCGG ATGGCCAAGC 
CTGACCAC6C TACCAGTGAA GTCTACGAGA TCATGGTGAA ATGCTGGAAC AGTGAGCCGG 
AGAAGAGACC CTCCTTITAC CACCTGAGTG A6ATTGTGGA GAATCTGCTG CCTGGACRAT 
&TAAAAAGAG TTATGAAAAA ATTCACCTGC ACTTCCTGAA GAGTGACCAT CCTGCTGTGG 
CACGCATGCG TGTGGACTCA GACAATGCAT ACATTGGTGT CACCTACAAA AACGAGGAAG 
ACAAGCTGAA GGACTGGGAG GGTGGTCTGG ATGAGCAGAG ACTGAGCGCT GACAGTGGCT 
ACATCATTCC TCTGCCTGAC ATTGACCCTG TCCCTGAGGA GGAGGACCTG GGCAAGAGGA 
ACAGACACAG CTCGCA6ACC TCTGAAGAGA GTGCCATTGA GACGGGTTCC AGCAGTTCCA 
CCTTCATCAA GAGAGAGGAC GAGACCATTG AAGACATCGA CATGATGGAC GACATCGGCA 
TAGACTCTTC AGACCTGCTG GAAGACAGCT TCCTGTAACT GGCGGATTCG AGGGGTTCCT 
TCCACTTCTG GGGCCACCTC TGGATCCCGT. TCAGAAAACC ACTTTATTGC AATGCGGAGG 
TTGAGAGGAG GACTTGGTTG ATGTTTAAAG AGAAGTTCCC AGCCAAGGGC CTCGGGGAGC 
CTTTCTAAAT ATGAATGAAT GGGATATTTT GAAATGAACT TTGTCAGTGT TGCCTCTTGC 
AATGCCTCAG TAGCATCTCA GTGGTGTGTG AAGTTTGGAG ATAGATGGAT AAGGGAATAA 
TAGGCCACAG AAGGTGAACT TTCTGCTTCA AGGACATTGG TGAGAGTCCA ACAGACACAA 
TTTATACTGC GACAGAACTT CAGCATTGTA ATTATGTAAA TAACTCTAAC CACGGCTGTG 
TTTAGATTGT ATTAACTATC TTCTTTGGAC TTCXGAAGAG ACCACTCAAT CCATCCATGT 
ACTTCCCTCT TGAAACCTGA TGTCAGCTGC TGTTGAACTT TTTAAAGAAG TGCATGAAAA 
ACCATTTTTG ACCTTAAAAG GTACTGGTAC TATAGCATTT TGCTATCTTT TTTAGTGTTA 
AAGAGATAAA GAATAATAAT TAACCAACCX TGTTTAATAG ATTTGGGTCA TTTAGAAGCC 
TGACAACTCA TTTTCATATT GTAATCTATG TTTATAATAC TACTACTGTT ATCAGTAATG 
CTAAATGTGT AATAATGTAA CATGATTTCC CTCCACACAA AGCACAATTT AAAAACAATC 
CTTACTAAGT AGGTGATGAG TTTGACAGTT TTTGACATTT ATATTAAATA ACATGTTTCT 
CTATAAAGTA TGGTAATAGC TTTAGTGAAT TAAATTTAGT TGAGCATAGA GAACAAAGTA 
AAXGTAGTGT TGTCCAGGAA GTCAGAATTT TTAACTGTAC TGAATAGGTT CCCCAATCCA 
TCGTATTAAA AAACAATTAA CTGCCCXCTG AAATAATGGG ATTAGAAACA AACAAAACXC 
TTAAGTCCTA AAAGTTCTCA ATGTAGAGGC ATAAACCTGT GCTGAACATA ACTTCTCATG 
TAT&TTACCC AATGGAAAAT ATAATGATCA GCGCANAAAG ACTGGATTTG CAGAAGTTNT 

nrmTm tcttcttgcc tgatgaaagc tttggcgacc ccaatatatg tattttttga 

ATCXATGAAC CXGAAAAGGG TCACAAAGGA TGCCCAGACA TCAGCCTCCT TCTTTCACCC 
CTTACCCCAA AGAGAAAGAG TTTGAAACTC GAGACCATAA AGATATTCTT TAGTGGAGGC 
TGGAAGXGCA TTAGCCXGAT CCTCAGTTCT CAAATGTGTG . TGGCAGCCAG GTAGACTAGT 
ACCTGGGTTT CCATCCTTGA GATTCTGAAG TATGAAGTCT GAGGGAAACC AGAGTCTGTA 
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CACGAACTTG CCATGGGAAA CAAATAATTT GAACTTTGGA ACAGGGTTCT TAAGTTGGTG 
CGTCCTTCGG ATGATAAATT TAGGAACCGA AGTCCAATCA CTGTAAATTA CGSTAGATCG 
ATCGTTAACG CTGGAATTAA ATTGAAAGGT CAGAATCGAC TCCGACTCTT TCGATTTCAA . 
ACCAAAACTG TCCAAAAGGT TTTCATTTCT ACGATGAAGG GTGACATACC CCCTCTAACT 
TGAAAGGGGC AGAGGGCAGA AGAGCGGAGG GTGAGGTATG GGGCGGTTCC TTTCCGTACA 
TGTTTTTAAT ACGTTAAGTC ACAAGCTTCA GAGACACATT GGTCGAGTCA CAAAACCACC 
TTTTTTGTAA AATTCAAAAT GACTATTAAA CTCCAATCTA CCCTCCTACT TAACAGTGTA 
GATAGG7GTG ACAGTTTGTC CAACCACACC CAAGTAACCG TAAGAAACGT TATGACGAAT 
TAACGACTAT GGTATACTTA CTTTGTACCC GACACTAATG ACGTTAGTGA CACGATAGCC 
GTCTACTACG AAACCTTCTA ATTATTTCAT GAACTGATGG ATGACCACAT 

TAGAGTTACG TTCGGCGTTG AAAGAATAGG TTGAAAAAGT ATCATTCACG CTTCTGACTC 
GGTCTAACCG GTTAATTTTr CTTTTGGACT GATCCAAGAC ATCTCGGTTA ATCTGAACTT 
TATGCAAACA CAAAGATCTT AGTGTCGAGT TCGTAAGACA AATAGCGAGT GAGAGGGAAC 
ATGTCGGAAT AAAACAACCA CGAAACGTAA AACTATAACG ACACTCGGAA CGTACTGTAG 
TACTCCGGCC TACTTTGAAG AGTCAGGTCG TCAAAGGTCA GGATTGTTTA CGAGGGTGGA 
CTTAAACATA TACTGACGTA AACACCCACA CACACACAAA AGTCGTTTAA GGTCTAAACA 
AAGGAAAACC GGAGGACGTT TCAGAGGTCT TCTTTTAAAC GGTTAGAAAG GATGAAAGAT 
AAAAATACTA CTGTTAGTTT CGGCCGGACT CTTTGTGATA AACACTGAAA AATTTGCTAA 
TCACTACAGG AATTTTACAC CAGACGGTTA GACATGTTTT ACCAGGATAA AAACACTTCT 
CCCTGTATTC TATTTTACTA CAATATGTAG TTATACATAT ATACATAAAG ATATATCTGA 
ACCTCTTATG ACGCTTTTGT AAATACTGTT CGACATAGTG ACGGAAGCAA ATATAAAAAA 
ATTGACACTA TTAGGGGTGT CCGTGTAATT GACAACGTGA AAACTTACAG GTTTTAAATA 
TAAAATCTTT ATTATTTTTC TTTCTATGAA TGTACAAGGG TTTTGTTACC ACACCACTTA 
CACACTCTTT TTGATTGAAC TATCCCAGAT GGTTATGTTT TACATAATGC TTACGGGGAC 
AAGTACAAAA ACAAAATTTT GCACATTTAC TTCTAGAAAT ATAAAGTTAT TTACTATATA 
TTAAATTTCC TTAAG 

(2) INFORMATION FOR SEQ ID NO; 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 41 base pairs 

(B) OTS: nucleic acid 

(C) STRANDEDHESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECOIZ TYPE: cDNA 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(3d) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
TCCTTCGACC TACAGATCAA TTAGCXTCCT GTAGGGGGCT G 
(2). INFORMATION FOR SEQ ID HO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: HO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
ATCACCGTGG TTGAGAGCGG CTAGCTTCCT GTAGGGGGCT G 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nuclaic acid 

(C) STRANDEDHESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: HO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(XX) SEQUENCE DESCRIPTION: SEQ ID HO:9: 

TACAGACTCC AGGTGTCATC CTAGCTTCCT GTAGGGGGCT G 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 base pairs 
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(ii) MOLECULE TYPE: cOHA 

(iii) HYPOTHETICAL; NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
CTCTACATCT TTGTGCCAGA TCCCTAGCTT CCTGTAGGGG GCTG 
(2) INFORMATION FOR SEQ ID NO:Il: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NOtll: 

CAGATCTCTC AGGGCCTGGT CACCGTGGGC TTCCTCCCTA ATCAT 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL; NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAGATCTCTC AGGGCCTGGT CATCAACGTC TCTGTGAACG CAGTGCAG 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGAHISM: Homo sapiens 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CAGATCTCTC AGGGCCTGGT CTACGTGCGG CTCCTGGGAG AGCTG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hobo Sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAGATCTCTC AGGGCCTGGT CGTCCGAGTG CTGGAGCTAA GT 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOI£CTI£ TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtio 



(3ci) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCTCCCACCC TGCGTTCTGA ATAACTGGCG GATTCGAGGG G 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) - LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 
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GAACTGTTAA CTCAAGTTCC TTAACTGGCG GATTCGAGGG G 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: YES 

(iv) AHTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtio 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATTTCTGTCC ATGAGAAAGG TTAACTGGCG GATTCGAGGG G 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO . . 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TATGCTTTAA AAGCAACATC ATAACTGGCG GATTCGAGGG G 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to nRNA 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISK: Hobo Sapiens 

(B) STRAIK: lambda gtlO 



(Xi) SEQfOENCE DESCRIPTION SEQ ID NO: 20: 
AGCCTAATCC TCTGCCAGCT TGATCTACCC TTTGTACCTC TAGGA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 48 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(YX) ORIGINAL SOURCE: 

(A) ORGANISM: Hobo Sapiens 

(B) STRAIN: lambda gtlO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
AGCCTAATCC TCTGCCAGCT TGAGCTGGAT CTAGAAATGG AAGCTCTT 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acxd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iV) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: lambda gtlO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
AGCCTAATCC TCTGCCAGCT TTTCATTGAA ATCAAACCCA CCTTC 
(2) INFORMATION FOR SEQ ID NO: 23: 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: YES 
(iv) AHTI-SEKSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo Sapiens 

(B) STRAIN: laabda gtio 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
AGCCTAATCC TCTGCCAGCT TTCATCCATT CTGGACTTGG. TC 42 
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CTTft ? TS CT ^TMED IS:' 

1. A platelet-derived growth factor receptor 
(hFDGF-R) fragment of between about 8 and 400 amino acids 
comprising one or more platelet-.derived growth factor (PDGF) 
ligand binding regions (LBR's) from extracellular domains Dl,. 
D2, or D3, wherein said fragment binds a platelet-derived 

growth factor ligand. 

2. A PDGF-R fragment of Claim 1, wherein said 

fragment exhibits an affinity of about 5 nM. 

3. A PDGF-R fragment of Claim 1, wherein said 
fragment comprises at least about 15 contiguous amino acids 
from a domain D3 intra-cysteine region. 

4. A PDGF-R fragment of Claim 1, wherein said 
fragment lacks a transmembrane region. 

5. A PDGF-R fragment of Claim 1, wherein said 

fragment is soluble. 

6. A PDGF-R fragment of Claim l, wherein at least 

one of said LBR's is a domain D3 LBR. 

7. A PDGF-R fragment of Claim 1, wherein at least 
one of said LBR's is from a type B or type A PDGF-R LBR. 

S. A PDGF-R fragment of Claim 1, wherein said 
fragment is a contiguous sequence within Table 1 or Table -2. 

9. A PDGF-R fragment of Claim 1, wherein said 

fragment is selected from the group of formulae consisting of: 

a) Xa-Dm-Xc? 

b) Xa-Dm-Xl-Dn-Xc ; 

c) Xa-Dm-Xl-Dn-X2-Dp-Xc; and 

d) Xa-Dm-Xl-Dn-X2-Dp-X3-Dq-Xc; 

. wherein: 

each of Xa, XI, X2, X3, and Xc is, if present, 
a polypeptide segment lacking a D domain; and 

each of Dm, Dn, Dp, and Dq is, independently 
of one another, selected from the group consisting of Dl, D2, 

D3, D4, and D5. 

10. A PDGF-R fragment of Claim 1, wherein said 
fragment is selected from the group consisting of: 

a) D1-D2-D3; and 

b) D1-D2-D3-D4. 
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11. A soluble human platelet -derived growth factor 
receptor (hPDGF-R) fragment of less than about 400 amino acids 
comprising at least one platelet-derived growth factor (PDGF) 
ligand binding region (LBR) from domain D3 , wherein said 

5 fragment specifically binds to a platelet-derived growth factor 
ligand. 

12. A hPDGF-R fragment of Claim 11, wherein said 
fragment comprises a sequence of at least about 15 contiguous 
amino acids from the intra-cysteine portion of domain D3. 

10 13. A hPDGF-R fragment of Claim 11, wherein said 

fragment is substantially pure. 

14. A hPDGF-R fragment of Claim 11,. wherein said 

LBR is derived from a type B or type A PDGF-R, and further is a 

sequence in Table 1 or Table 2. 
15 15. A nucleic acid sequence encoding a PDGF-R 

fragment of Claim 1. 

16. A nucleic acid sequence encoding a hPDGF-R 
fragment of Claim 11. 

17. A nucleic acid of Claim 15 wherein said 
20 encoding sequence is operably linked to a promoter. 

18. A cell comprising a PDGF-R fragment of 

Claim 1. 

19. A cell comprising a hPDGF-R fragment of 

Claim 11. 

25 20. A mammalian cell comprising a nucleic acid of 

claim 15. 

21. A mammalian cell comprising a nucleic acid of 

claim 16. 

22. A cell comprising both a nucieic acid of Claim 
30 15,. and a protein expression product of said nucleic acid. 

23. An. antibody which recognizes an epitope of a 
PDGF-R fragment of Claim 1, wherein said epitope is not found 
on a natural PDGF-R. 

24. An antibody of Claim 23 , wherein said antibody 
35 is a monoclonal antibody. 

25. A method for measuring the PDGF ligand binding 
activity of a biological sample comprising the steps of: 
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a) contacting an aliquot of said sample to a 
PDGF ligand in the presence of a PDGF-R fragment of Claim 1 in 
a first analysis; 

b) contacting an aliquot of said sample to a 
5 PDGF ligand in the absence of said PDGF-R fragment in a second 

analysis; and 

c) comparing the amount of said PDGF ligand 

binding in the two analyses. 

26. A method of Claim 25 , wherein said PDGF-R 

X0 fragment is attached to a cell . 

27. A method of Claim 26 , wherein said PDGF-R 
fragment is attached to a solid substrate, 

28. A method of Claim 27, wherein said solid 
substrate is a microtiter dish. 

15 29 . A method for measuring the PDGF ligand content 

of a biological sample comprising the steps of: 

a) contacting an aliquot of said sample to a 
ligand binding region (LBR) in the presence of a PDGF-R 
fragment of Claim 1 in a first analysis; 

b) contacting an aliquot of said sample to a 
LBR in the absence of said PDGF-R fragment in a second 
analysis; and 

c) comparing the amount of binding in the two 

analyses. 

30. A method of Claim 29, wherein said contacting 
steps are performed simultaneously. 
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